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Abstract: We develop a novel approach to phase transitions in quantum spin models 
based on a relation to their classical counterparts. Explicitly, we show that whenever 
chessboard estimates can be used to prove a phase transition in the classical model, the 
corresponding quantum model will have a similar phase transition, provided the inverse 
temperature /3 and the magnitude of the quantum spins S satisfy f3 <C VS. From the 
quantum system we require that it is reflection positive and that it has a meaningful clas- 
sical limit; the core technical estimate may be described as an extension of the Berezin- 
Lieb inequalities down to the level of matrix elements. The general theory is applied to 
prove phase transitions in various quantum spin systems with S ^> 1. The most notable 
examples are the quantum orbital-compass model on Z 2 and the quantum 120-degree 
model on 1? which are shown to exhibit symmetry breaking at low-temperatures de- 
spite the infinite degeneracy of their (classical) ground state. 
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1. Introduction 

It is considered common knowledge that, for spin systems, the behavior of a quantum 
model at finite temperature is "like" the behavior of the corresponding classical model. 
However, beyond the level of heuristics, it is far from clear in what sense the above 
statement is meaningful. Another, slightly more academic way to "recover" the clas- 
sical spin system is to consider spin-representations with spin-magnitude 5 and then 
let 5 — > oo. A standard argument as to why this should work is that the commutators 
between various spin operators are order- 1/5 smaller than the quantities themselves, 
and so the spins behave essentially classically when 5 is large. Notwithstanding, pre- 
cise statements along these lines have only been made for the 5 — > oo limit of the free 
energies [4,27,28,36,45] and specific types of 1/5 corrections [12,38,39]. 

A common shortcoming of the above studies is that neither spells explicit conditions 
on the relative magnitude of (3 and 5 for which the classical behavior is exhibited. This 
is of importance because, at sufficiently low temperatures, the relevant excitations are 
quantum. For example, while the classical Heisenberg antiferromagnet on a finite bi- 
partite graph has a continuum of ground states (related by the SO(3) symmetry), the 
quantum Heisenberg antiferromagnet has a unique ground state [37]. Another example 
is the 1 1 1 -interface in the classical Ising model which, at zero temperature, is disor- 
dered but may be stabilized by appropriate (but arbitrarily small) quantum perturba- 
tions [9, 32]. The control of the relevant quantum excitations is a non-trivial subject and 
is usually accomplished only when finite-temperature effects are of little significance 
for the overall behavior. 

The preceding discussion is particularly important for systems which undergo phase 
transitions. Here several techniques have been available — infrared bounds [20, 26], 
chessboard estimates [23-25,33] and contour expansions [10, 13, 14,35] — some of 
which (specifically, the latter two) are more or less based on the assumption that the 
quantum system of interest has a strong classical component. However, while cer- 
tain conclusions happen to apply uniformly well even as 5 — > oo, the classical ref- 
erence state of these techniques is usually discrete (e.g., Ising type). This is quite unlike 
the 5 — > oo limit which inherently leads to a continuous-spin, Heisenberg-like model. 
Thus, the relation between the above "near-classical" techniques and the 5 — > oo results 
discussed in the first paragraph is tenuous. 

The purpose of this paper is to provide a direct connection between the 5 — > oo ap- 
proach to the classical limit of quantum spin systems and the proofs of phase transitions 
by the traditional means of chessboard estimates. Explicitly, we establish the following 
general fact: Whenever chessboard estimates can be used to prove a phase transition in 
the classical system, a corresponding transition will occur in the quantum system pro- 
vided \/5 is sufficiently larger than the inverse temperature. This permits us to prove 
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phase transitions in systems with highly degenerate ground states, but without contin- 
uous symmetry, as well as certain temperature driven phase transitions which have not 
been accessible heretofore. 

To highlight the main idea of our approach, let us recall how chessboard estimates 
enter the proofs of phase transitions. Suppose a quantum system on the torus is parti- 
tioned into disjoint blocks and a projector on a "bad event" is applied in some of the 
blocks. The goal is to show that the expectation — in the quantum Gibbs state — of the 
product of these projectors decays exponentially with the number of bad blocks. Here 
the chessboard estimates offer a non-trivial simplification: The expectation to the in- 
verse number of bad blocks is maximized by the configuration in which all blocks are 
bad. In classical models, the latter quantity — sometimes referred to as the universal con- 
tour — is often fairly easy to estimate by properly accounting for energy and entropy of 
the allowed configurations. However, this is not the case once quantum effects get into 
play; the only general technique that has been developed for this purpose is the "prin- 
ciple of exponential localization" [25] which hinges on an approximate diagonalization 
of the "universal projectors" and model-specific spectral estimates. 

The main feature of our approach is that we bound the (relevant) universal contours 
directly — namely, by the universal contours for the classical (i.e., S = oo) version of 
the quantum system. The technical estimate making this possible is a new bound on the 
matrix element of the Gibbs-Boltzmann weight relative to coherent states \fl), which 
is close in the spirit to the celebrated Berezin-Lieb inequalities [4,36]. The result is that 
(f}\t~ f3H \Q) is dominated by the classical Gibbs-Boltzmann weight times a correction 
that is exponential in 0(P/VS) x volume. Hence, if /3 <C \/S, the exponential growth- 
rate of partition functions, even those constrained by various projectors, is close to that 
of the classical system. This is ideally suited for an application of chessboard estimates 
and the corresponding technology — developed in [23-25, 33] — for proving first-order 
phase transitions. Unfortunately, the bound in terms of universal contour has to be per- 
formed before the "conversion" to the classical setting and so we still require that the 
quantum system is reflection positive. 

To showcase our approach, we provide proofs of phase transitions in the following 
five quantum systems (defined by their respective formal Hamiltonians): 

(1) The anisotropic Heisenberg antiferromagnet: 

H = + S~ 2 (JiS*S*, + J 2 S y r S y r , + S z r S z r ,) (1.1) 

(r ,r') 

where < Ji, J 2 < 1. 

(2) The non-linear XY-model: 

H = - E y( 5 ^'+ 5 " s " ) (1.2) 

<r,r') ^ ' 

where 7{x) = T\(x 2 ) ± x7 2 (x 2 ) for two polynomials 7i, 7 2 (of sufficiently high 
degree) with positive coefficients. 

(3) The non-linear nematic model: 

H = - ^2 V(S- 2 (S r ■ S r/ ) 2 ) (1.3) 

<r,r') 

where S r ■ S r > = S*S*,+SyS y rl +S z r S*, and where 7 is a polynomial— typically 
of high degree — with positive coefficients. 
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(4) The orbital compass model on 1?: 



(5) The 120-degree model on Z 3 : 




C — 2 QX QX 

s- 2 sys v r „ 



if r' = r ± e 



if r ' = r ± e 



(1.4) 



fl" = T r 3 ?' if r'=r± Bj 



(1.5) 



where 



°r J 



if j = 1 



if j = 2 



(1.6) 



_ 1 oi _ ot/ 



if j = 3. 



Here (r , r ') denotes a nearest-neighbor pair on Z d — where unless specified we are only 
assuming d > 2 — the symbol Bj stands for the unit vector in the j-th lattice direction 
and S r = (S* , S% , S* ) is a triplet of spin-5 operators for the spin at site r . The scaling 
of all interactions by the indicated inverse powers of S is necessary to make the S — > oo 
limit meaningful. 

Model (1) has been included only for illustration; the requisite transition was proved 
for large anisotropy [25] and, in the context of the ferromagnet (which is not even re- 
flection positive), for arbitrarily small anisotropy [31]. The classical versions of models 
(2-4) feature strong order-disorder transitions at intermediate temperatures; cf [1, 16, 
22, 33]. Here we will prove that corresponding transitions occur for large-5 quantum 
versions of these systems. Models (4-5) are quite unusual even at the classical level: 
Notwithstanding the fact that the Hamiltonian has only discrete symmetries, there is a 
continuum of ground states. As was shown in [6,7], at positive temperatures the de- 
generacy is lifted leaving only a finite number of preferential directions. The proofs 
of [6,7] involve (classical) spin-wave calculations not dissimilar to those of [18, 19]. 
However, since the massless spin-wave excitations are central to the behavior of these 
systems — even at the classical level — it is by no means clear how to adapt the methods 
of [10, 13, 14, 20, 23-25, 31, 33, 35] to these cases. 

The remainder of the paper is organized as follows: In the next section, we recall 
the formalism of coherent states, which is the basis of many S — > oo limit results, 
and the techniques of reflection positivity and chessboard estimates, which underline 
many proofs of phase transitions in quantum systems. In Sect. 3 we state our main 
theorems; the proofs come in Sect. 4. Applications to the various phase transitions in 
the aforementioned models are the subject of Sect. 5. The Appendix (Sect. 6) contains 
the proofs of some technical results that would detract from the main line of argument 
in Sects. 5.3-5.5. 

2. Preliminaries 

In this section, we summarize standard and well-known facts about the SU(2) coherent 
states (Sect. 2.1) and the techniques of chessboard estimates (Sect. 2.2). The purpose of 
this section is mostly informative; a reader familiar with these concepts may skip this 
section altogether and pass directly to the statement of main results in Sect. 3. 
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2.1. Coherent states. Here we will recall the Bloch coherent states which were the basis 
for rigorous control of various classical limits of quantum spin systems [4,27,28,36,45]. 
In a well defined sense, these states are the "closest" objects to classical states that 
one can find in the Hilbert space. Our presentation follows closely Lieb's article [36]; 
some of the calculations go back to [3]. The theory extends to general compact Lie 
groups, see [17,45] for results at this level of generality. The literature on the subject 
of coherent states is quite large; we refer to, e.g., [2,42] for comprehensive review and 
further references. 

Given S € {V2, 1, %, • • • }, consider the (2S + l)-dimensional irreducible represen- 
tation of the Lie algebra su(2). The generators, (S x , S y , S z ), obeying the commutation 
rules [S l , S^} = 2ieijkS k , are operators acting onspan{|M) : M = —S, S+l, ... ,S — 
1, <S} ~ C 25+1 . In terms of spin-rasing/lowering operators, S ± = S x ± iS y , we have 

S Z \M) =M\M), 

S+\M) = ^S(S + 1) - M(M + 1) |M + 1), (2.1) 

S~\M) = y/S(S + 1) - M(M - 1) \M - 1). 

In particular, S x and S z are real while S y is purely imaginary. 

The classical counterpart of su(2)-spins are vectors on the two-dimensional unit 
sphere S^i in K 3 . For each J? <G ^2, one defines the coherent state vector in the direction 
fl to be 

\ Q )= E ( S + m) ^[cosM^tsin^^-^e'^-^IM). (2.2) 

M=-5 ^ ' 

Here (9, <p) are the spherical coordinates of Q, with 9 denoting the azimuthal angle and 
4> denoting the polar angle. Let ( — tan( e / 2 )e 1 ^ denote the stereographic projection 
from ,5^2 to C. Then (2.2) can be written as 

\Q) - e^^ s+ \S) = [l + \C\ 2 ]- S ^ \S) 

= [cos(e/ 2 )} 2S exp(tan(«/ 2 )e^5-) \S) . (2.3) 

One important property of the coherent state | Q) is that it is an eigenvector of the matrix 
J? • S with maximal eigenvalue: 

{Q-S)\Q) = S\f2). (2.4) 

This equation characterizes the vector | Q) up to a phase factor. The choice of the phase 
factors may seem arbitrary, but in practice they will cancel in all the formulas we use. 

The fact that the states \Q) have been defined relative to the basis in (2.1) is incon- 
sequential. Indeed, a rotation of a coherent state is, to within a harmless phase factor, 
the coherent state corresponding to the rotated vector. More precisely, for each lo e y 2 
and tel, one may consider the unitary U Ui t — e'^^'^. Then, for any Q e ^2, a 
simple calculation shows that 

U„, t (f2-S)U+ t = RutM- S, (2.5) 

where R^,t € SO(3) is the rotation about the ray passing through lo by the angle t. 
Because of this U u .t\fi) satisfies (2.4) with Q replaced by _R wt (i7) and so 
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for some phase factor /(I?, u, t). Since SU(2) is a double cover of SO(3), f(fi, u, 2ir) 
is not necessarily (mod 2ir); rather s l f( n ^< 2 ^) = (-1) 25 . 
The explicit formula (2.2) for \fi) yields 

(fi'\fi) = [cos(*/ 2 ) cos( e '/ 2 ) + S^-^ sin(*/ 2 ) si<'/ 2 )] 25 (2.6) 
Defining the angle between fi and fi' to be 0, one also has 

\(fi'\fi)\ = [cos(©/ 2 )] 2S . (2.7) 
Another formula that is directly checked from (2.2) is 

2S +1 f 

1 = — — / dn\n)(n\, (2.8) 

where dJ? denotes the uniform surface measure on S% with total mass 4ir. 

Given any operator A on C 25+1 , one can form what is commonly known as the 
lower symbol, which is a function fi (A) q defined by 

(A) n ■■= (f2\A\f2). (2.9) 

(Here and henceforth, ( fi\A\ fi) denotes the inner-product of | fi) with the vector A\ fi) .) 
While not entirely obvious, it turns out that the trace of A admits the formula 

o c i 1 r 

Tr(A) = — — / dfi(A) n . (2.10) 

There is also a generalization of (2.8): There exists a function fi [A\q such that 

o c I i r 

A = —— Afi[A] S2 \fi)(fi\. (2.11) 

Any such fi [A] q is called an upper symbol for A. Unfortunately, such a function is 
not unique and so [A] q actually represents an equivalence class of functions. Obviously 
{A + B)q = (A) f2 + (-E?)f2. For the upper symbols, if [A]q and [B] q are upper symbols 
for A and B then [A + B\q = [A]q + [B]q is an upper symbol for A + B. 

When A = 1, one has = 1 and, by (2.8), one can also choose [A]q = 1. 

However, it is usually not the case that the lower symbol is also an upper symbol, e.g., 
we have 

{S x ) a = S sm6 COS0, [S x ] n = (S + 1) sin<9 cos0, 

(S v ) n = S sin6 sin0, [S y ] n = (S + 1) sine sin^ (2.12) 

(S z )n - S cose, [S z } n = (S + 1) cosfl. 

As is easily checked, the leading order in S of these expressions is exactly the classical 
counterpart of the corresponding operator. For more complicated products of the spin 
components, both symbols develop lower-order "non-classical" corrections but, as was 
shown in [17, Theorem 2], the leading order term is always the classical limit. 

The above formalism generalizes to collections of many spins. Let A be a finite set 
and, for each r E A, let (5* , S* 2 , ) be the spin operator for the spin at r . We will as- 
sume that the spins at all sites have magnitude S, so we assume to have a joint (product) 
representation of these spins on Ha = <S) r e/iP 2 ' S+1 ]»" Consider an assignment of a 



Quantum spin systems at positive temperature 



7 



classical spin Q r e y 2 to each r E A and denote the resulting configuration (f2 r ) r eA 
by fl. The desired product coherent state then is 

\0) := (g)|^r). (2.13) 

r eA 

Given an operator A on Ha, we define its lower symbol by the generalization of (2.9), 
(A) n = {f2\A\f2), ne(y 2 ) lAl - (2.14) 
With this lower symbol we may generalize (2.10) into 

Tr HA (A) = f^r^V ' / dH (A) n . (2.15) 



47T 



(^2)1^1 



There is also a representation of A in terms of an upper symbol [^4]^, 

^=(^^J y r ...drt[A] fl |tf></?|. (2.16) 



47T 



'(^2) 



where dJ? is the product surface measure on (=5*2 and where i— > [A]^ is now a 
function (^ 2 ) |y11 — > C. A special case of this formula is the resulution of the identity 
on W^. Note that (2.16) allows us to substitute [A] n for (A) n in (2.15). 

It is easy to check that fi i— > [A] ^ has the expected behavior under (tensor) prod- 
uct of operators, provided these respect the product structure of Ha- Indeed, suppose 
that A is the disjoint union of A\ and A 2 and let \ fl\) and \fl 2 ) be product coherent 
states from Ha x and Ha 2 > respectively. Given two operators A\ : Ha x — * and 
A 2 : W/t 2 — > Ha 2 > l et [^iJj?! and [A 2 ]^ 2 t> e their associated upper symbols. Then 

[Ai®A 2 ] [i2l j i2) := [A x ] ni [A 2 ]n 2 (2.17) 

is an upper symbol of A\ ® A 2 relative to state |(J?i, i? 2 )) = ® On the other 
hand, if [A] a depends only on {Q r ) re A' where A' C ^ then we can perform a partial 
trace in (2. 16) by integrating over the (Q r ) r eA^.A' and applying (2.8) for each integral. 



2.2. Chessboard estimates. Next we will review the salient features of the technology 
of reflection positivity/chessboard estimates which was developed and applied to both 
classical and quantum systems in the works of F. Dyson, J. Frohlich, R. Israel, E. Lieb, 
B. Simon and T. Spencer [20,23-26]. 

Consider a C*-algebra 21 and suppose that 21+ and 2l_ are commuting subalgebras 
which are "mirror images" of each other in the sense that there is an algebraic automor- 
phism 9 : 21 — > 21 such that #(2l±) = 21+ and 9 2 = id. Assuming that 21 is represented 
in terms of complex matrices, for A e 21 we define A to be the complex conjugate — not 
the adjoint — of A. We will always assume that 21 is closed under complex conjugation. 
Note that, since complex conjugation is not a "covariant operation," the representation 
of 21 ought to stay fixed throughout all calculations involving complex conjugation. 

A relevant example of the above setting is a quantum spin-5 system on the d- 
dimensional torus Tj, of L x • • • x L sites, with L even, which we think of as a union 
of two disjoint symmetric halves, and T£. (Note that can also be identified 
with Z d /LZ d . Of course the origin e Z d maps to the origin of the torus.) Then 21 is 



8 



M. Biskup, L. Chayes and S. Starr 



the C*-algebra of all observables — represented by (2S + 1)I Tl I dimensional complex 
matrices — and 2l± are the sets of observables on T^, respectively. Explicitly, 2t+ are 
matrices of the form A + ® 1 , where A + "acts" only on T^, while the matrices in 2l_ 
take the form 1 ® The operation 9 is the map that interchanges the "left" and "right" 
half of the torus; e.g., in a properly parametrized basis, 8(A + ® 1 ) — 1 <E> A + . The fact 
that 9 arises from a reflection leads to the following concept: 

Definition 2.1. Let {—) be a state — i.e., a continuous linear functional — on 21 and let 9 

be as above. We say that (— } is reflection positive (relative to 9) if for all A,B e 2l + , 



(A0(B)) = (B0(A)) (2.18) 

and 

(A9{A))>0. (2.19) 

The following condition, derived in [20, Theorem E.l] and in [25, Theorem 2.1], is 
sufficient for the Gibbs state to have the above property: 

Theorem 2.2 (Reflection positivity — sufficient condition). Given a reflection of Tl 
as described above and using 9 to denote the associated reflection operator, if the 
Hamiltonian of a quantum system on Tl can be written as 



H = C + 9(C) - J g(da) D a 6{D a ), (2.20) 

where C, D a € 21+ and g is a (finite) positive measure, then the canonical Gibbs state 
(— ) L p, which is defined by 

Tr„ T (e-P R A) 

^ ■ (2 - 21) 

is reflection positive relative to 9 for all > 0. 

The crux of the proof of (2.19) is the fact that the — state is generalized reflec- 
tion positive, i.e., (A\9(Ai) . . . A n 9(A n )) £,o > 0. The rest follows by a Lie-Trotter 
expansion of eT^ 11 into powers of the last term in (2.20) — hence the need for a minus 
sign in front of the integral. 

Remark 2. 3. We reiterate that the reflections of Tl considered here are always for 
"planes of reflections" between sites. In classical models one can also consider the 
(slightly more robust) reflections for "planes" on sites. However, due to non-commu- 
tativity issues, Theorem 2.2 does not seem to generalize to quantum systems for these 
kinds of reflections. 

Reflection positivity has two important (and related) consequences: Gaussian domi- 
nation — leading ultimately to infrared bounds — and chessboard estimates. In this work 
we make no use of the former; we proceed by discussing the details of the latter. 

Let Ab be a block of B x • • • x B sites with the "lower-left" corner at the ori- 
gin. Assuming that L is a multiple of B, we can tile by disjoint translates of Ab- 
The positions of these translates are given by _B-multiples of vectors t from the factor 
torus T L / B . In particular, if A B + r denotes the translate of Ab by r G Tl, then 
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is the disjoint union {J te j L b (Ab + Bt). Let %a b denote the algebra of observables 
in Ab, i.e., each A G ^Ab has the form A = As ® 1 , where As acts only on the por- 
tion of the Hilbert space corresponding to Ab - For each A G 21a b and each t G T^/b 
with \t\ = 1, we can define an antilinear operator $t(A) in Ab + Bt by 



MA) = 0(A) (2.22) 

where 6 is the operator of reflection along the corresponding side of Ab- By taking 
further reflections, we can define i>t{A) for every t G T L / B . (Thus i?t is linear for 
even-parity t and antilinear for odd-parity t; if every component of t is even then i? t 
is simply the translation by Bt) It is easy to check that the resulting $t(A) does not 
depend on what sequence of reflections has been used to generate it. 

The fundamental consequence of reflection positivity, derived in a rather general 
form in [25, Theorem 2.2], is as follows: 

Theorem 2.4 (Chessboard estimate). Suppose that the state (—) is reflection positive 
for any "plane of reflection" between sites on Tj,. Then for any A\, . . . , A m G %Ia b 
and any distinct vectors t\ , . . . , t m G T L / B , 

I m \ ml \ ( B / L ) d 

( n^(^-))< n ( n <m^)) ^ 

By (2.23) we may bound the expectation of a product of operators by product of ex- 
pectations of so called "disseminated" operators. As we will show on explicit examples 
later, these are often easier to estimate. Note that the giant products above can be written 
in any order by our assumption that the block-operators in different blocks commute. 

A corresponding statement works also for classical reflection-positive measures. 
The only formal difference is that the Afs are replaced by functions, or indicators 
of events Aj, which depend only on the spin configuration in A B . Then equation (2.23) 
becomes 

, m v m . s (B/L) d 

p Pima) <n p n ■ (2 - 2 ^ 

Here 9t(A) is the (usual) reflection of A to the block A B + Bt. (We reserve the sym- 
bol #t {A) for an operation that more closely mimics i} t in the coherent-state represen- 
tation; see the definitions right before Proposition 3.4.) Refs. [5, 6, 8] contain a detailed 
account of the above formalism in the classical context; the original statements are, of 
course, due to [23-25]. 

Remark 2.5. Unlike its classical counterpart, the quantum version of reflection positiv- 
ity is a rather mysterious concept. First, for most of the models listed in the introduction, 
in order to bring the Hamiltonian to the form (2.20), we actually have to perform some 
sort of rotation of the spins. (We may think of this as choosing a different representation 
of the spin operators.) The purpose of this operation is to have all spins "represented" 
by real-valued matrices, while making the overall sign of the interactions negative. This 
permits an application of Theorem 2.2. 

It is somewhat ironic that this works beautifully for antiferromagnets, which thus 
become effectively ferromagnetic, but fails miserably [47] for genuine ferromagnets. 
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For XY-type models, when only two of the spin-components are involved in the inter- 
action, we can always choose a representation in which all matrices are real valued. If 
only quadratic interactions are considered (as for the nematics) the overall sign is in- 
consequential but, once interactions of different degrees are mixed — even if we just add 
a general external field to the Hamiltonian — reflection positivity may fail again. 



3. Main results 

We now give precise statements of our main theorems. First we will state a bound 
on the matrix elements of the Gibbs-Boltzmann weight in the (overcomplete) basis of 
coherent states. On the theoretical side, this result generalizes the classic Berezin-Lieb 
inequalities [4, 36] and thus provides a more detailed demonstration of the approach to 
the classical limit as S — > oo. On the practical side, the bound we obtain allows us to 
replace the "exponential localization" technique of Frohlich and Lieb [25] — which is 
intrinsically quantum — by an estimate for the classical version of the model. 

The rest of our results show in detail how Theorem 3.1 fits into the standard line of 
proof of phase transitions via chessboard estimates. In Sect. 5 we will apply this general 
strategy to the five models of interest. 



3.1. Matrix elements of Gibbs-Boltzmann weights. We commence with a definition of 
the class of models to which our arguments apply. Consider a finite set A C 7L d and, for 
each r C A, let h r be an operator on Ha = ® re yi[C 2,s+1 ]r that depends only on the 
spins in r. (I.e., hr is a tensor product of an operator on and the unity on H.A^r-) 
We will assume that hr = if the size of r exceeds some finite constant, i.e., each 
interaction term involves only a bounded number of spins. The Hamiltonian is then 



Most of the interesting examples are such that hr = unless r is a two point set {x, y} 
containing a pair of nearest neighbors on Z d — as is the case of all of the models (1-5) 
discussed in Sect. 1. 

As already noted, our principal technical result is a bound on the matrix element 
(Q\e~^ H \fi > ). To state this bound precisely, we need some more notation. Let Q i— > 
[/it]j2 be an upper symbol of the operator hr which, by (2.17), may be assumed inde- 
pendent of the components (Q r ) x gr- We^zx the upper symbol of H to 



We will also use \r\ to denote the number of elements in the set r and \\hr\\ to denote 
the operator norm of hr on Ha- 

Let \Q r — Q' r \ denote the (3-dimensional) Euclidean distance of the points £2 r 
and Q' r on 5^2, and consider the following i 1 and £ 2 -norms on (J^)' 71 ': 




(3.1) 



r-. r<zA 




(3.2) 



r ■. r<zA 




(3.3) 



r£A 
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and 

\\Q-Q'\\ 2 = [Y,\ Q r -^| 2 ) 2 - (3.4) 

Besides these two norms, we will also need the "mixed" quantity 

d s (n,Q')= J2(^\^r-n' r \^S\n r -Q' r \ 2 ), (3.5) 

r eA 

where A denotes the minimum. This is not a distance function but, as will be explained 
in Lemma 4.2, it does satisfy an inequality which could be compared to the triangle 
inequality. Finally, from (2.7) we know that | (Q r | Q' r ) \ = 1 - 0{S\Q r - Q' r | 2 ). Hence, 
there is ij > such that 

| (flltf) | < e -vs\\n-n'\\l (36) 

holds for all S, all Q, J?' <E (=5^)'^' and all A. We fix this r\ throughout all forthcoming 
derivations. (Since [cos^A)] 2 = 1 — V4II rt — 0'\\ 2 for a single spin, we have 77 = l U. 
But rj plays only a marginal role in our calculations so we will leave it implicit.) Our 
first main theorem then is: 



Theorem 3.1. Suppose that there exists a number R such that 

\r\ > R h r = 0, (3.7) 

and that, for some constants cq and C\ independent of S and A, we have 

sup Yj IMI<co (3.8) 
xeA r-.xercA 

as well as the Lipschitz bound 

\[hr]n - [hr]n>\ < ci||/2- fi'||i||/i r ||, r c A. (3.9) 

Then for any constant C2 > 0, there exists a constant C3 > 0, depending only on c , C\, 
C2 and R, such that for all (3 < C2VS, 

\{Q\^ H \Q')\ < e -m\n-^s{OM')+c 3 (3\A\/VS (3 1Q) 

holds for all !7, Q' e (^2) |A| and all finite A. 

Note that we do not assume that the Hamiltonian is translation-invariant. In fact, as 
long as the conditions (3.7-3.9) hold as stated, the geometry of the underlying set is 
completely immaterial. For the diagonal elements — which is all we need in the subse- 
quent derivations anyway — the above bound becomes somewhat more transparent: 

Corollary 3.2. Suppose (3.7-3.9) hold and let c 2 and C3 be as in Theorem 3.1. Then for 
all (3 andS with (3 < c 2 \A5, all fl e (J^ 2 ) |A| and all A, 

e -P(H)a < (n\e-f> H \f2) < e -fm\n+c 3 (3\A\/VS (3 n) 
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It is interesting to compare this result with the celebrated Berezin-Lieb inequali- 
ties [4, 36] which state the following bounds between quantum and classical partition 
functions: 



(An unpublished proof of E. Lieb, cf [46], shows both inequalities are simple conse- 
quences of Jensen's inequality; the original proof [36] invoked also the "intrinsically 
non-commutative" Golden-Thompson inequality.) From Corollary 3.2 we now know 
that, to within a correction of order (3/VS, the estimates corresponding to (3.12) hold 
even for the (diagonal) matrix elements relative to coherent states. However, the known 
proofs of (3.12) use the underlying trace structure in a very essential way and are not 
readily extended to a generalization along the lines of (3. 1 1). 

Remarks 3.3. Some comments are in order: 

(1) The correction of order @\A\/VS is the best one can do at the above level of gener- 
ality. Indeed, when fl and J7' are close in the sense 1 1 i? — i?' 1 1 1 = O ( | yl | / \J~S) , then 
[H] q and [H] n 1 differ by a quantity of order c\ \ A\ / \/~S. Since the matrix element is 
symmetric in Q and J?', the bound must account for the difference. However, there 
is a deeper reason why (3/VS needs to be small for the classical Boltzmann weight 
to faithfully describe the matrix elements of the quantum Boltzmann weight. Con- 
sider a single spin with the Hamiltonian H — S~ 1 S Z , and let Q correspond to the 
spherical angles (0, <p). A simple calculation shows that then 



The term (3 cos 9 is the (now unambiguous) classical interaction in "state" Q. The 
leading correction is of order f3 2 /S, which is only small if (3 <C VS. 
(2) Another remark that should be made, lest the reader think about optimizing over 
the many choices of upper symbols in (3.10): The constant C3 depends on the upper 
symbol. For hp being a polynomial in spin operators, [hr] may be chosen a poly- 
nomial too [17, Proposition 3]. This automatically ensures properties such as the 
Lipschitz continuity (as well as existence of the classical limit, cf (3.14)). For more 
complex hr's — e.g., those defined by an infinite power series — one must carefully 
check the conditions (3.7-3.9) before Theorem 3.1 can be applied. 



3.2. Absence of clustering. Our next task is to show how Theorem 3.1 can be applied to 
establish phase transitions in models whose (S — ► 00) classical version exhibits a phase 
transition that can be proved by means of chessboard estimates. The principal conclu- 
sion is the absence of clustering which, as we will see in Sect. 3.3, directly implies a 
quantum phase transition. 

Consider the setting as described in Sect. 2.2, i.e., we have a torus Tl of side L 
which is tiled by {L/B) d disjoint translates of a block Ab of side B. For each operator 
in Ab and each t e T L / B , we write -dt (A) for the appropriate reflection — accompanied 
by complex conjugation if t is an odd parity site — of A "into" the block Ab + Bt. In 




(3.12) 



(Q\e^ H \n) = [cos 2 («/2)e-^/ s +sin 2 (0/ 2 )e^/ s ] 

_ e -/3cose+||(l-cos 2 6)+0(/3 3 /5 2 ) 



25 



(3.13) 
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addition to the operators on Hj L = ® teT [C 25+1 ]t, we will also consider events A 
on the space of classical configurations ( t 5^2)' Tjj ' equipped with the Borel product a- 
algebra and the product surface measure dH = YireT L d^r- If A is an event that 
depends only on the configuration in As, we will call A a B-block event. For each t e 
Tl/Bj we use 9t {A) to denote the event in Ab + Bt that is obtained by (pure) reflection 
of .4 "into" /Is + Bt. 

Given a quantum Hamiltonian H of the form (3.1), let (—)l,0 denote the thermal 
state (2.21). Considering the classical Hamiltonian H°° : (^ 2 ) |Tl ' 1 -» R, which we 
define as 

H°°(Q) = lim (H) a = lim [H) n , (3.14) 

5 — *oo 5 — >oo 

we use to denote the usual Gibbs measure. Explictly, for any event A C (^2)' T , 

F LM = J A <W^ r , (3.15) 

where Zl((3) is the classical partition function. For each £>-block event .4 we will also 
consider its disseminated version nteT L/B ®t{A) and introduce the abbreviation 



teT L/B 



(B/L) d 



(3.16) 



for the corresponding quantity on the right-hand side of (2.24). An application of (2.23) 
shows that A t— > Pl,/?(^4) is an outer measure on the er-algebra of 5-block events 
(cf [6, Theorem 6.3]). 

For each measurable set A C (^2)' Tl ' we consider the operator 

25+1 x|T i | 



Qa = [~^~) J A dn \v)(n\- (3.17) 

Since the coherent states are overcomplete, this operator is not a projection; notwith- 
standing, we may think of it as a non-commutative counterpart of the indicator of the 
event A. In order to describe the behavior of under i?t> we introduce the classi- 
cal version § t of i?t which is defined as follows: Consider a "complex-conjugation" 
map a: («^2)' Tl ' — » (^2)' Ti ' which, in a given representation of the coherent states, 
has the effect 

\n)(n\ = \aQ){aQ\. (3.18) 

For the representation introduced in Sect. 2.1, we can choose a to be the reflection 
through the xz-plane (in spin space), i.e., if Q = (8, 4>) then <j{Q) = (8, — 0). For 
even parity t e T L / B , we simply have d t = 9 t while for odd parity t e T L / B we 
have i? t = 6 t o a. 

Here are some simple facts about the Q-operators: 
Proposition 3.4. For any B-block event A we have 



(3.19) 
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Moreover, if A\ , . . . , A m are B-block events and t\ , . . . , t m are distinct elements from 
Tl/b» then 

[Qe ti {A^QotM^ = °' 1 " 1 < 3 - m ' (3 ' 20) 

and 

Qe tl (Ai) ■ ■ ■ Qe tm (A m ) = Qe tl (A 1 )n-r\e tm {A m )- (3.21) 

Finally, Q of the full space (i.e., (^2)' T " L U is the unity, Q$ = 0, and if Ai,A2, ■ ■ ■ is a 
countable collection of disjoint events, then (in the strong-operator topology) 

oo 

SUSU^E&U- (^ 

n=l 

/n particular, Qj^ = 1 — Q a for any event A. 

Proof. The map # t is a pure reflection for even-parity t e T L / B and so (3.19) holds by 

the fact that pure reflection of Q_4 is Q of the reflected A. For odd-parity t, the relation 

(3.18) implies = Q a (A)> which yields (3.19) in these cases as well. The remaining 
identities are easy consequences of the definitions and (2.8). □ 

Remark 3.5. The last few properties listed in the lemma imply that the map A — » Qa 
is a positive-operator-valued (POV) measure, in the sense of [15]. As a consequence, 
if A C A' then Qa < Qa' while if {A n } is a countable collection of events, not 
necessarily disjoint, then 

oo 

%~ =1 A n <J2®^- (3-23) 

71=1 

Both of these properties are manifestly true by the definition (3.17). 

Before we state our next theorem, let us recall the "standard" setting for the applica- 
tion of chessboard estimates to proofs of phase transitions in classical models. Given B 
that divides L, one typically singles out a collection Q\, . . . , Q n of "good" _B-block 
events and defines 

B = (0i U • • • U G n y (3.24) 

to be the corresponding "bad" _B-block event. Without much loss of generality we will 
assume that B is invariant under "complex" reflections, i.e., i?t {&) — TBt (B), where r r 
denotes the shift by r on (^2)' Ti '- In the best of situations, carefully chosen good 
events typically satisfy the conditions in the following definition: 

Definition 3.6. We say that the "good" B-block events are incompatible if 

(1) they are mutually exclusive, i.e., Qi fl Q 3 ; = whenever i =/= j; 

(2) their simultaneous occurrence at neighboring blocks forces an intermediate block 
(which overlaps the two neighbors) i.e., there exists £ with 1 < £ < B such that 

0t{Qi) n MSj) C T Bt+e(t >-t)(B) (3.25) 



holds for all i ^ j and any t,t' £ T L / B with \t — t'\ = 1. Here T r is the shift by r. 
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These conditions are much easier to achieve in situations where we are allowed to 
use reflections through planes containing sites. Then, typically, one defines the C?j's so 
that the neighboring blocks cannot have distinct types of goodness. But as noted in 
Remark 2.3, we are not allowed to use these reflections in the quantum setting. Nev- 
ertheless, (1) and (2) taken together do ensure that a simultaneous occurrence of two 
distinct types of goodness necessarily enforces a "contour" of bad blocks. The weight 
of each such contour can be bounded by the quantity pL t p(B) to the number of consti- 
tuting blocks; it then remains to show that pL,p(B) is sufficiently small. For quantum 
models, appropriate modifications of this strategy yield the following result: 

Theorem 3.7. Consider a quantum spin system on with spin S and interaction for 
which the Gibbs state {—) L pfrom (2.21) is reflection positive for reflections through 
planes between sites on T^. Let H°° be a function and £ > a constant such that, for 
all L>1, 

sup \[ff] n -H°°(n)\ + sup \(H) a -H°°({2)\<Z\T L \. (3.26) 

fie(«y 2 )i T ii fie(^ 2 )i T -L 

Let Qi, . . . ,Q n be incompatible "good" B-block events and define B as in (3.24). Sup- 
pose that B is invariant under reflections and conjugation a, i.e., ~dt{B) = TBt(B) for 
all t e T L j B . Fix e > 0. Then there exists S > such that if f3 < c 2 VS and 

where c 2 and C3 are as in Theorem 3.1, we have 

(Qb) l ^ < e (3.28) 
and, for all i = 1, . . . ,n and all distinct ti,t 2 € T L / B , 

(^MS,) I 1 -Qo t2{gi )]) L ^<e. (3.29) 

Here S may depend on e and d, but not on (3, S, n nor on the details of the model. 

Remarks 3.8. Here are some notes concerning the previous theorem: 

(1) By general results (e.g., [17]) on the convergence of upper and lower symbols 
as S — > 00, the quantity £ in (3.26) can be made arbitrarily small by increas- 
ing S appropriately. In fact, for two-body interactions, £ is typically a small con- 
stant times 1 /S and so it provides a harmless correction to the term c 3 / >/S in 
(3.27). In particular, apart from the classical bound that p L,p{B) <§C 1, (3.27) will 
only require that j3 <C VS. 

(2) Note that the result is stated for pure reflections, 9t{Gi), of the good events, not 
their more complicated counterparts "dt{Qi)- This is important for maintaining a 
close link between the nature of phase transition in the quantum model and its clas- 
sical counterpart. We also note that H°° is not required to be reflection positive for 
Theorem 3.7 to hold. (Notwithstanding, the classical Hamiltonian will be reflection 
positive for all examples in Sect. 5.) 
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(3) The stipulation that the # t 's "act" on B only as translations is only mildly restric- 
tive: Indeed, u(B) = B in all cases treated in the present work. However, if it turns 
out that <j(B) 7^ B, the condition (3.27) may be replaced by 



which — since pL,p(cr(B)) < 1 — is anyway satisfied by a stricter version of (3.27) 
(this does need reflection positivity of H°°). Note that <r(B) = B implies that every 
configuration in a(Qi) is also good. In most circumstances we expect that a(Gi) is 
one of the good events. 



3.3. Phase transitions in quantum models. It remains to show how to adapt the main 
conclusion of Theorem 3.7 to the proof of phase transition in quantum systems. We 
first note that (3.27) is a condition on the classical model which, for 6 small, yields a 
classical variant of (3.29), 



Under proper conditions on e and the probabilities of the ft's, this yields absence of 
clustering for the classical torus Gibbs state which, by a conditioning "on the back of 
the torus" — see the paragraph before Lemma 4.5 — implies the existence of multiple 
infinite-volume Gibbs measures. 

For a quantum system with an internal symmetry, a similar argument allows us to 
deal with the cases when the symmetry has been "spontaneously" broken. For instance 
(see [25]) in magnetic systems (3.29) might imply the non-vanishing of the sponta- 
neous magnetization which, in turn, yields a discontinuity in some derivative of the free 
energy, i.e., a thermodynamic phase transition. In the cases with no symmetry — or in 
situations where the symmetry is not particularly useful, such as for temperature-driven 
phase transitions — we can still demonstrate a thermodynamic transition either by con- 
cocting an "unusual" external field (which couples to distinct types of good blocks) or 
by directly proving a jump e.g. in the energy density. 

An elegant route to these matters is via the formalism of infinite-volume KMS states 
(see, e.g., [30,46]). Let us recall the principal aspects of this theory: Consider the C* al- 
gebra 21 of quasilocal observables defined as the norm-closure of {J A c2, d wnere the 
union is over all finite subsets A and where 21 a is the set of all bounded operators on 
the Hilbert space Ha = &) re A\ ( ^ 2S+1 ~\ r ■ (T° interpret the union properly, we note that 
if A C A', then 21a is isomorphic to a subset of 21^', via the map A — > A ® 1 with 1 
being the identity in 21a'\a-) For each L > 1, let us identify T L with the block A L and 
let Hl be the Hamiltonian on Yl which we assume is of the form (3.1) with hp finite 
range and translation invariant. 

For each observable A £ %Ia l , let a[ L \A) = e ltHL Ae~ ltHh be the strongly-conti- 
nuous one-parameter family of operators representing the time evolution of A in the 
Heisenberg picture. For A local and Hl finite range, by expanding into a series of 
commutators 




(3.30) 



PL,/j(Mft)nMe?)) <c 



1 < i < n. 



(3.31) 




(3.32) 
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the map t ^> a t '(A) extends to all t £ C, see [30, Theorem III.3.6]. Moreover, 

the infinite series representation of a[ L \A) converges in norm, as L — > oo, to a one- 
parameter family of operators a t (A), uniformly in t on compact subsets of C. (These 
facts were originally proved in [43].) 

A state (— ) | gon2l — i.e., a linear functional obeying (A)p > if A > 0and(1)„ = 
1 — is called a KMS state (for the translation-invariant, finite-range interaction H at 
inverse temperature (3) if for all local operators A, B £ 21, the equality 

(AB)p = (ot-ip(B)A) p , (3.33) 

also known as the KMS condition, holds. This condition is the quantum counterpart 
of the DLR equation from classical statistical mechanics and a KMS state is thus the 
counterpart of the infinite-volume Gibbs measure. 

We proceed by stating two general propositions which will help us apply the results 
from previous sections to the proof of phase transitions. We begin with a statement 
which concerns phase transitions due to symmetry breaking: 

Proposition 3.9. Consider the quantum spin systems as in Theorem 3.7 and suppose 
that the incompatible good block events Qi, ■ ■ ■ ,Q n are such that (Qg k ) l,0 is the same 
for all k — 1, . . . , n. If (3.28-3.29) hold with an e such that (n + l)e < V2, then 
there exist n distinct, KMS states (—)p k \ k = l,...,n, which are invariant under 
translations by B and for which 

{Qg k )f > l-(n + l)e, fc = l,...,n. (3.34) 

The proposition says that there are at least n distinct equilibrium states. There may 
be more, but not less. This ensures a phase transition, via phase coexistence. 

Our second proposition deals with temperature driven transitions. The following is 
a quantum version of one of the principal theorems in [33, 34]: 

Proposition 3.10. Consider the quantum spin systems as in Theorem 3.7 and let Q\ 

and Q2 be two incompatible B-block events. Let j3\ < be two inverse temperatures 
and suppose that e e [0, V4) is such that for all L > 1, 

(1) the bounds (3.28-3.29) hold for all (3 £ [/?i,/3 2 ], 

(2) (0si)i,/3i > 1 - 2c and (Qg 2 ) L j h > 1 - 2c 

Then there exists an inverse temperature /3 t £ [/?i,/?2] and two distinct KMS states 

(-)ji and (— )p 2 ^ at inverse temperature (3 t which are invariant under translations 
by B and for which 

(Qe 1 )^ ) >l-4e and (Qg 2 )^ > 1 - 4e. (3.35) 

The underlying idea of the latter proposition is the existence of a forbidden gap in the 
density of, say, C/i-blocks. Such "forbidden gap" arguments have been invoked in (lim- 
iting) toroidal states by, e.g., [29, 33, 34]; an extension to infinite-volume, translation- 
invariant, reflection-positive Gibbs states has appeared in [8]. Both propositions are 
proved in Sect. 4.3. 
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4. Proofs 

Here we provide the proofs of our general results from Sect. 3. We begin by the es- 
timates of matrix elements of Gibbs-Boltzmann weight (Theorem 3.1) and then, in 
Sect. 4.2, proceed to apply these in quasiclassical Peierls' arguments which lie at the 
core of Theorem 3.7. Finally, in Sect. 4.3, we elevate the conclusions of Theorem 3.7 
to coexistence of multiple KMS states, thus proving Propositions 3.9-3.10. 



4.1. Bounds on matrix elements. The proof of Theorem 3.1 is based on a continuity 
argument whose principal estimate is encapsulated into the following claim: 

Proposition 4.1. Suppose that (3.7-3.9) hold with constants R, Co, and c\. Let Hq = 

H — [H]i2. Suppose there exist c 2 > and e > such that for all j3 < C2VS, 

K^le-^l^')! < e-^^'^+^l (4.1) 

is true for all fi, fi' € (J^)'^ 1 '- Then there exists a constant c 3 depending on c , C\, c 2 
and R (but not A, S or e) such that for all (3 < c 2 VS, 

A(^| e -/3-&«|/2') <_^|4e--»M«.«')+*W (4.2) 
dp vo 

Before we commence with the proof, we will make a simple observation: 
Lemma 4.2. For all A and all Q, fi' , fi" G (J^)' 71 ', 

d s ([2,fi')< d s (n',n") + ^s\\£t-n"\\ 1 + J2 1 Wr*n« } - (4.3) 

reA 

Proof. Since all "norms" in the formula are sums over r € A, it suffices to prove 
the above for A having only one point. This is easy: For fi = fi" the inequality is 
actually an equality. Otherwise, we apply the bounds ds(fi, fi') < VS\fi — fi'\ and 
d s (fi' , fi") + I > VS\fi' - fi"\ to convert the statement into the triangle inequality 
for the I 1 -norm. □ 

Proof of Proposition 4.1. Let us fix fi and fi' for the duration of this proof and abbre- 
viate M(f3) = {Q\e~P Hn \fi'). We begin by expressing the derivative of M(f3) as an 
integral over coherent states. Indeed, M'(/3) = —([2\Hq e~P Hn \f2') and so inserting 
the upper-symbol representation (2.16) for Hq = J2rcA(hr — [hr]n), we have 

M'((i) = -J2 (^r 1 )^ I d ^" (om{fy'\^\n%h r } n ,,-[h r ]n). 

(4.4) 

By the fact that [hr]^„ — [hr]n depends only on the portion of fi" on r, the integrals 
over the components of fi" outside r can be carried out which yields 

M'(P) = -J2 f^r^V '/ dQ'f{Q r \Q , f){Q ,, \ e -^\n')([h r }^-[h r }n). 

(4.5) 
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Here, as for the rest of this proof, Q" is set to Q outside r and to Qp in F. 

LetXr denote the integral on the right-hand side of (4.5). Using (3.6), (4.1) and (3.9) 
we have 

\Ir\ < CiH/irlle^l 71 ! / d ^ e -vds(n'^')- v sU2''-f2f 2 -f3([H] S! ,,-[H]n)^n_ n ^^ 

(Recall from the definition that Hq = Hq» — [H] q + [H] a" .) In order to bound the 
right-hand side, we need a few simple estimates. First, noting that 

[H\n» - [H]n = ^ - [M«) , (4-7) 

r'-.r'nr^ 

(3.8) and (3.9) imply that, for some constant C4 depending only on cq, c\ and R, 

\[H\n» - [H)n\ < c 4 \\f2 - = c 4 \\f2'f - Q r \\i- (4.8) 
Second, Lemma 4.2 tells us 

-d s {0',f2") < -d s {0,n') + y/S\\f2 r - ^'fh + \ r \- ( 4 -9) 

Finally, \\f2" — is bounded by times the exponential of v!S \\{2 — 

Since we are assuming that (3 < c 2 VS, we conclude that 

& -ni s (n' ,f2")-0([H] nll -[H\o)\\Qii _ q\\ < e?7|r| c -r,d s (n,n')+c 5 Vs\\n r -n'M\i 

Vs 

(4.10) 

for some constant C5 independent of S and A. 
Plugging this back in the integral (4.6), we get 

|/j r || c 0e\A\-nd s (a,a') f e c b ^\\n r -nz\\- l - n s\\n r -n'{.\\l _ 

(4.11) 

To estimate the integral, we note that both norms in the exponent are sums over individ- 
ual components. Hence, the integral is bounded by the product of \r\ integrals of the 
form 

K= I dr" e c 5V / 5k-T'"|-r;5|r-r"| 2 ^ (4.12) 

where r and r" are vectors on S^i — representing the corresponding 3-dimensional 
components of flp and Q'f — and where \r — r"\ denotes Euclidean distance in R 3 . 
Parametrizing by r = \r —r"\ and integrating over the polar angle of r" relative tor, 
we now get 

r 2 

K= dr J(r)e-^ Sr2+C5 ^ r . (4.13) 
Jo 

Here the Jacobian, ^f(r), is the circumference of the circle {r" : \r"\ = 1, \r — r"\ = 
r}. But this circle has radius smaller than r and so J> (r) < 2nr. Scaling r by S~ x / 2 
yields K < c 6 /S for some constant c 6 > independent of S. 
Plugging this back in (4.1 1), we then get 

\Ir\ < -2L(^)' r Vr||e-" d -^^)+^^. (4.14) 
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Inserting this into (4.5), using (3.7) to bound the terms exponential in \r\ by a con- 
stant depending only on R — this is possible because there are \T\ factors of S's in the 
denominator of (4.14) that can be used to cancel the factors (25 + 1) in front of the 
integral in (4.5) — and applying (3.8), we get (4.2). □ 

On the basis of Proposition 4.1, the proof of Theorem 3 . 1 is easily concluded: 
Proof of Theorem 3.1. Let c 2 and C3 be the constants from Proposition 4.1 and let e = 
c 3 /VS. We claim that (4.1) holds for all (3 < c 2 y/S. First, in light of (3.6) and the 
definition of d s (S2, Q'), (4.1) holds for j3 = 0. This allows us to define j3 to be the 
largest number such that (4.1) holds for all (3 e [0, /3b]. Now, if (3 < /3o A c 2 \/~S, then 

Proposition 4. 1 and our choice of e guarantee that the /3-derivative of {S2\e~^ Hn \ Q') is 
no larger than that of the right-hand side of (4.2). We deduce (by continuity) that /3rj = 
c 2 VS. Using that Hq = H — [H]n, we now get (3.10). □ 

Proof of Corollary 3.2. First we observe that the diagonal matrix element (ST\e~@ H \ ST) 
is real and positive. The upper bound is then the SI' — S2 version of Theorem 3.1; 
the lower bound is a simple consequence of Jensen's — also known as the Peierls- 
Bogoliubov — inequality; see, e.g., [46, Theorem 1.4.1]. □ 



4.2. Quasiclassical Peierls' arguments. Our goal is to prove the bounds (3.28-3.29). 
To this end, let us introduce the quantum version of the quantity from (3.16): For any B- 
block event A, let 

/ V (B/L) d 

\teT L/B I L J3 

(Note that, by (3.19), this is of the form of the expectation on the right hand side 
of (2.23).) First we will note the following simple consequence of Theorem 3.1: 

Lemma 4.3. Let £ be as in (3.26) and let c 2 and C3 be as in Theorem 3.1. If (3 < c 2 VS, 
then for any B-block event A, 

q L AA) < [PlAA)Pl,pHA))} 1/2 e^^). (4.16) 
Proof. By (3.21) we have 

<\ljM) = (Qj) { L B / 3 L)d where -4= n (4 - i?) 

teT L/B 

Invoking the integral representation (3.17), the bounds from Corollary 3.2 and the defi- 
nition of £ from (3.26), 

<\lAA) < 7? L AM B/L)d z m+C3 '^ ] - (4-18) 

Now we may use (2.24) for the classical probability and we get (4.16). □ 

Next we will invoke the strategy of [25] to write a bound on the correlator in 
(3.29) in terms of a sum over Peierls contours. Let denote the set of connected 

sets Y C T^ B with connected complement. By a contour we then mean the bound- 
ary of a set Y S i.e., the set <9Y of nearest neighbor edges on T L / B with one 
endpoint in Y and the other endpoint in Y c C T L / B . The desired bound is as follows: 
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Lemma 4.4. Let Gi, . . . ,Q n be incompatible good events and let B be the bad event 
with the property that TBt(B) = $t(<B) f or a H * <= "^l/b- Then for all distinct ti,t 2 G 
T L / B and all i = 1 , . . . , n, 

Qe tASi) Qe t2 W)) L < E 2[4q L ,M£)]^ |9¥| . (4.19) 

' P Y: Y£^ L/B 
ti£Y,t 2 £Y 

Proof. We begin by noting that t\ ^ t 2 and (3.20-3.21) give us 

Qo tl (o<)Qo*2W) = (—n-) / dn\a)(a\. (4.20) 

Now pick f2 G Ot 1 (Gi)^dt 2 (Gi) and let Y' C T^/b be the largest connected component 
of 5-blocks — i.e., translates of As by Bt, with t G T L / B — such that t\ G Y' and 
that 9t(Qi) occurs for every t G ¥'. This set may not have connected complement, so 
we define Y G ^l/b t° be the set obtained by filling the "holes" of Y', except that 
which contains t 2 - Note that all translates of Ab corresponding to the boundary sites 
of Y are of type Qi. 

In order to extract the weight of the contour, we will have to introduce some more 
notation. Decomposing the set of boundary edges <9Y into d sets d\ Y, . . . , <9<jY accord- 
ing to the coordinate directions into which the edges are pointing, let j be a direction 
where |<9jY| is maximal. Furthermore, let Y™ 1 be the set of sites in Y c which are on 
the "left" side of an edge in djY. It is easy to see that this singles out exactly half of 
the sites in Y c that are at the endpoint of an edge in djY. Next we intend to show that 
the above setting implies the existence of at least |YJ"|/2 bad blocks whose position is 
more or less determined by Y. 

Recall that Bj denotes the unit vector in the j-th coordinate direction. Since the 
good events satisfy the incompatibility condition (3.25), at least one of the following 
two possibilities must occur: either fl G r B t (£>) for at least half of t G YJ" or J? G 
TBt+ihj (B) for at least half of t G YJ". (Here I is the constant from the definition 
of incompatibility.) Indeed, if the former does not occur then more than half of t G 
Y^ xt mark a good block, but of a different type of goodness than Gi- Since this block 
neighbors on a ft-block, incompatibility of good block events implies that a bad block 
must occur £ lattice units along the line between these blocks. 

Let us temporarily abbreviate Kj = |YJ"| and let ^j(Y) be the set of collections 
of Kj/2 sites representing the positions of the aforementioned Kj/2 bad blocks. In 
light of TBt(B) = -dtiB), the above argument implies 

/Kj/2 K 3 /2 \ 

e tl (Gi) n e t2 (<j?) c (J (J if] (<? ti (fi)) u f| m 3 (tf ti (B)) . 

Y:Ye^ /B (ti)e^j(Y) \ i=l i=l / 

ii£Y,t 2 0Y 

(4.21) 

Therefore, using the fact that A t— » Qa is a POV measure (cf Remark 3.5), this implies 

/Kj/2 ^ K 3 /2 ^ 

Qe tl ( Si )Qe t2 (gt) < E E II + II Q Ttt .(**.(B)) 

Y:Ye^ L/ B (t«)e^j(Y) \ i=l »=i n ' 

ti£Y,t 2 0Y 

(4.22) 
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Here the two terms account for the two choices of where the bad events can occur and j 
is the direction with maximal projection of the boundary of Y as defined above. Since 
(2.23), (3.19) and 9(B) = B allow us to conclude that 

iKj/2 y 

( II <*LAB) Ki/2 > ( 4 - 2 3) 

\ »=1 / L,f3 

and since the translation invariance of the torus state (—) L p implies a similar bound is 
also valid for the second product, the expectation of each term in the sum in (4.22) is 
bounded by 2q Ltf3 (B) K ^ 2 . The sum over (U) £ ^(Y) can then be estimated at 2 K i 
which yields 

{Qe tl ( Si) Qo t2 ( Sd ) L < E 2 [4qL,M£)]' ¥ri/2 - (4-24) 

,P Y: Y6^ L/B 
ii£Y,t 2 0Y 

From here the claim follows by noting that our choice of j implies | Y^ xt | > ^|<9Y| (we 
assume that &(\l^(B) < 1 without loss of generality). □ 

Proof of Theorem 3.7. By Lemma 4.3, the assumptions on B, and (3.27) we have that 
(\l,p(B) < S. Invoking a standard Peierls argument in toroidal geometry — see, e.g., the 
proof of [6, Lemma 3.2] — the right-hand side of (4.19) is bounded by a quantity r](5) 
such that rj(S) J. as S J. 0. Choosing S sufficiently small, we will thus have r)(5) < e, 
proving (3.29). The bound (3.28) is a consequence of the chessboard estimates which 

yield (Q b )l,p < <\lA b ) < s - D 



4.3. Exhibiting phase coexistence. In order to complete our general results, we still 
need to prove Propositions 3.9 and 3.10 whose main point is to guarantee existence 
of multiple translation-invariant KMS states. (Recall that, throughout this section, we 
work only with translation-invariant interactions.) Let us refer to 

T+ = {x e T L : - LV 4 - V 2 J < x x < p/ 4 - V2I } (4.25) 

as the "front side" of the torus, and to T£ as the "back side." Let 2lJ be the C* algebra 
of all observables localized in (i.e., an operator in 21 £ acts as the identity on T£). 

The construction of infinite-volume KMS states will be based on the following stan- 
dard lemma: 

Lemma 4.5. Let T L / B be the factor torus and let Am C T L / B be a block of M x • • • x 
M sites at the "back side" ofT L / B (i.e., we have dist(0, Am) > — M). Given a 
B-block event C, let 

Pl,m{C) = j^-r E Qo t {C)- ( 4 '26) 

Suppose that (Qc)l,[3 > c for all L 3> 1 and some constant c > 0, and define the 
"conditional" state {—) L M -p 011 by 

( A ) - (plMC)A) l ,p ( , 21) 

\ / L,M:8 /' (r>\\ • V*-LI) 
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If{—)p is a (subsequential) weak limit of {—) L M ^ as L — > oo (along multiples of B) 
followed by M — > oo, then (— ) /3 is a KMS state at inverse temperature (3 which is 
invariant under translations by B. 

Proof. Translation invariance is a consequence of "conditioning" on the spatially-ave- 
raged quantity (4.26). Thus, all we need to do is to prove that the limit state satisfies 
the KMS condition (3.33). Let t <— > a[ L ^ be the unitary evolution on Tl. If B is a local 
observable that depends only on the "front" side of the torus the fact that the interaction 
is finite range and that the series (3.32) converges in norm, uniformly in L, implies 

[a[ L \B),p LM {C)\ — (4.28) 

in norm topology, uniformly in t on compact subsets of C. (Note that, for any B lo- 
calized inside a fixed finite subset of Z d , for large enough L, it will always be in the 
"front" side T£, under the projection 7L d — > = Z d /LZ d .) This means that for any 
bounded local operators A and B on the "front" side of the torus, 

(p L . M (C)AB) L fj = (p LiM (C)a%(B)A) +o(l), L - oo. (4.29) 

(Again, it is no restriction to say that A and B are on the "front" side, by simply letting 
L be large enough.) Since a^ fj (B) — > ct-ip(B) in norm, the state A i — > (^4) l,M;/3 
converges, as L — > oo and M — > oo, to a KMS state at inverse temperature /?. □ 

Proof of Proposition 3.9. By + + • • • + Qg n = 1, the symmetry assumption 
and (3.28) we know that 

(Qg k ) L ^>^. (4.30) 

So, if PL,M{Gk) is as in (4.26), the expectation (pL,Ai(Gk))L,p is uniformly positive. 
This means that, for each k = 1, . . . , n, we can define the state {—)^m-^ ^ = 1, . . . , n, 
by (4.27) with the choice C = Qk- Using (3.29) we conclude 

(QW>)) ( £ M . P >1-^ k = l,...,n, (4.31) 

for any t on the "front" side of T L / B (provided that M <C l Ib). For (n + l)e < V2, the 

right-hand side exceeds V2 and so any thermodynamic limit of (-)^M-a as L ^ 00 
and M — ► 00 is "domintated" by ^-blocks. Since, by Lemma 4.5, any such limit is a 
KMS state, we have n distinct states satisfying, as is easy to check, (3.34). □ 

Proof of Proposition 3.10. Consider the states (— )^ M . (3 and (— }^ M . /3 defined by 
(4.27) with C = Q\ and C = Q2, respectively. From assumption (1) we know that 
flfc := (pL,M(Gk)} > for at least one k = 1, 2 and so, for each /3 e [/3 X , /3 2 ], at least 
one of these states is well defined. We claim that we cannot have (Q^) l,M;P < 1 ^ 4e 
for both k = 1, 2. Indeed, if that were the case then 



Plm{Qi) + Pl,m(Gz) + Pl,m{&) = 1 



(4.32) 
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and the bounds (3.28-3.29) would yield 

ai + a 2 = (Qg 1 + Qg 2 ) L ^ 

+ ( PlMSi) Qg 2 ) L ^ + < PL.M^) Q Sl ) L}0 < 4 - 33 ) 

+ (pl,m(B) [1-Qb]) Li(I 
< (l-4e)(ai+a 2 ) + 3e 

i.e., 4(ai + a 2 ) < 3. Since e < 1 U this implies a\ + a 2 < 3 /4 < 1 — e, in contradiction 
with assumption (1). 

Hence, we conclude that the larger from {Qg k )^M-p> k = 1,2 (among those states 
that exist) must be at least 1 — 4e. The same will be true about any thermodynamic limit 
of these states. Let S k C [/?i,/3 2 ], fc = 1, 2, be the set of j3 e [Pi, /3 2 ] for which there 
exists an infinite-volume, translation-invariant KMS state (— ) /3 such that (Qg k )p > 
1 — 4e. Then Si U H 2 = [/3i, /3 2 ]. Now, any (weak) limit of KMS states for inverse tem- 
peratures j3 n — > (3 is a KMS state at /3, and so both E\ and S 2 are closed. Since [/3\ , /3 2 ] 
is closed and connected, to demonstrate a point in Si n S 2 it suffices to show that 
both Si and S 2 are non-empty. For that we will invoke condition (2) of the proposi- 
tion: From (Qg^L,^! > 1 — 2e we deduce 

(Q0^ M = 1 - (Qs> + Qb) ( ^ m , 01 > 1 - ^ > 1 - 4e, (4.34) 

/ 2 \ 

and similarly for (Qg 2 ) l,M;/3 2 - Thus /3i G ^i and /3 2 e £2, i.e., both sets are non- 
empty and so £"i n £ 2 7^ as claimed. □ 

5. Applications 

Here we will discuss — with varying level of detail — the five quantum models described 
in the introduction. We begin by listing the various conditions of our main theorems 
which can be verified without much regard for the particulars of each model. Then, 
in Sect. 5.2, we proceed to discuss model (1) which serves as a prototype system for 
the application of our technique. Sects. 5.3-5.5 are devoted to the details specific for 
models (2-5). 



5.7. General considerations. Our strategy is as follows: For each model we will need 
to apply one of the two propositions from Sect. 3.3, depending on whether we are deal- 
ing with a "symmetry -breaking" transition (Proposition 3.9) or a temperature-driven 
energy-entropy transition (Proposition 3.10). The main input we need for this are the 
inequalities (3.28-3.29). These will, in turn, be supplied by Theorem 3.7, provided 
we can check the condition (3.27). Invoking Theorem 3.1, which requires that our 
model satisfies the mild requirements (3.7-3.9), condition (3.27) boils down to show- 
ing that pL,p{B) is small for the requisite bad event. It is, for the most part, only the 
latter that needs to be verified on a model-specific basis; the rest can be done in some 
generality. 
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We begin by checking the most stringent of our conditions: reflection positivity. 
Here, as alluded to in Remark 2.5, we are facing the problem that reflection positiv- 
ity may be available only in a particular representation of the model — which is often 
distinct from that in which the model is a priori defined. The "correct" representation 
is achieved by a unitary operation that, in all cases at hand, is a "product rotation" of 
all spins. 

There are two rotations we will need to consider; we will express these by means 
of unitary operators Ua and {/ B . Consider the Hilbert space Ht l = ® re j [C 2S+1 } r 
and let (S£ , S% , ) have the usual form — cf (2. 1) — on Ht l ■ In this representation, the 
action of Ua on a state \ip) £ Ht l is defined by 

[/ A |V} = J] e^e^lV)- (5.1) 

The effect of conjugating by this transformation is the cyclic permutation of the spin 
components — ► S* — > S* — > S%. The second unitary, Ub, is defined as follows: 

w>= n e " s "i^)- (5-2) 

reT L 

odd-parity 

The effect of Ub on spin operators is as follows: For even-parity r, the spin operators 
are as before. For odd-parity r, the component S% remains the same, while both S* 
and pick up a minus sign. Here are the precise conditions under which our models 
are reflection positive (RP): 

Lemma 5.1. Let Ua and Ub be the unitary transformations defined above. Then: 

(a) UaHU^ 1 is RP for models (4-5), and for model (2) with 1 ? (x) = y 1 (x 2 )+x1' 2 (x 2 ). 

(b) UbHU^ 1 is RP for models (1,3). 

(c) UbUaHU^U^ 1 is RP for model (2) with ?(x) = ^(x 2 ) - x9 2 {x 2 ). 

Proof, (a) Under the unitary Ua map, the Hamiltonians of models (4-5) are only using 
the x and z-components of the spins, which are both real valued. The resulting interac- 
tion couples nearest-neighbor spins ferromagnetically, and thus conforms to (2.20). 

(b) For two-body, nearest-neighbor interactions, Ub has the effect 

S r S r i ► S r S r i , ol — x , z , (5.3) 
while the S}I.S v rl terms remain unchanged. Writing 

SlSl = -(iS*)(iS« ) (5.4) 

we can thus change the sign of all quadratic terms in the interaction and, at the same 
time, express all operators by means of real- valued matrices. Under the conditions given 
in Sect. 1, the Hamiltonians in (1.1) and (1.3) are then of the desired form (2.20). 

(c) Finally, for model (2), we first apply the argument in (a). Then the effect of Ub 
is that the minus sign in JP(x) = 5'i(a; 2 ) — x r p2{x 2 ) becomes a plus sign. □ 

Our next items of general interest are the "easy" conditions of Theorem 3.1 and 
Theorem 3.7. These turn out to be quite simple to check: 
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Lemma 5.2. The transformed versions — as defined in Lemma 5.1 — of the five models 
from Sect. 1 satisfy the conditions (3.7-3.9) with some finite R and some c\ independent 
of S. Moreover, for each of the models (1-6) there exists a constant C such that (3.26) 
holds with £ = C/S for all S. 

Proof. All interactions involve at most two spins so R = 2 suffices to have (3.7). Writ- 
ing the interaction in the form (3.1), the normalization by powers of S makes the corre- 
sponding norms \\hr\\ bounded by a quantity independent of S. This means that (3.8) 
holds in any finite set (including the torus, with proper periodic extension of the fir's). 
As to the Lipschitz bound (3.9), this is the subject of Theorem 2 and Proposition 3 
of [17]. Since S~ 1 [S"]n ~ fi r + 0(1/ S), and similarly for the lower symbol, the 
same argument proves that £ = 0(1 /S). □ 

To summarize our general observations, in order to apply Propositions 3.9-3.10, we 
only need to check the following three conditions: 

(1) The requisite bad event is such that ■dt(B) = B for all t £ T L i B . 

(2) The occurrence of different types of goodness at neighboring £>-blocks implies that 
a block placed in between the two (so that it contains the sites on the boundaries 
between them) is bad — cf condition (2) of Definition 3.6. 

(3) The quantity pL^(B) is sufficiently small. 

In all examples considered in this paper, conditions (1-2) will be checked directly but 
condition (3) will require estimates specific for the model at hand. (Note that, since we 
are forced to work in the representation that makes the interaction reflection positive; 
the conditions (1-3) must be verified in this representation.) 

Remark 5.3. It is noted that all of the relevant classical models — regardless of the signs 
of the interactions — are RP with respect to reflections in planes of sites. We will often 
use this fact to "preprocess" the event underlying pL,p{B) by invoking chessboard es- 
timates with respect to these reflections. We will also repeatedly use the subadditivity 
property of A i— > pL,p(-A) as stated in [6, Theorem 6.3]. Both of these facts will be 
used without (much) apology. 



5.2. Anisotropic Heisenberg antiferromagnet. Consider the reflection-positive version 
of the Hamiltonian (1.1) which (in the standard representation of the spin operators) on 
the torus Tl takes the form 



(The classical version of Hl is obtained by replacing each S 1 ^ by the corresponding 
component of SSl r .) The good block events will be defined on a 2 x • • • x 2 block A b — 
i.e., B = 2 — and, roughly speaking, they will represent the two ferromagnetic states in 
the z-direction one can put on Ab- Explicitly, let Q + be the event that fi r = (9 r , <f) r ) 
satisfies \9 r \ < n for al r e Ab and let Q- be the event that \0 r — 7r| < k for 
all r e A B - 

Theorem 5.4 (Heisenberg antiferromagnet). Let d > 2 and let < J\ , J2 < 1 be 

fixed. For each e > and each n > 0, there exist constants c and (5q and, for all [3 and S 




(5.5) 
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with Po < P < c\/~S, there exist two distinct, translation-invariant KMS states (— )^ 
and (— )p with the property 

(Qg ± )p>l-e- (5.6) 
In particular, for all such (3 we have 

(S z o)^ (S z o r p > 0. (5.7) 

Proof. Let B = (G+ U G-) c be the bad event. It is easy to check that i?t a °ts on ,6 only 
via translations. Moreover, if Q + and occur at neighboring (but disjoint) translates 
of Ab, then the block between these is necessarily bad. In light of our general obser- 
vations from Sect. 5.1, we thus only need to produce good bounds on p L,p{B)> the 
classical probability of bad behavior. Since these arguments are standard and appear, 
for all intents and purposes, in the union of Refs. [11,23,24,44], we will be succinct 
(and not particularly efficient). 

LetZ\ = min{(l-Ji), (1- J 2 ), 2/a d } where a d = d2 d ~ 1 and fix rj > with 77 < 1 
such that 

1- cost] -A sin 2 k < 0. (5.8) 

We will start with a lower estimate on the full partition function. For that we will restrict 
attention to configurations where \6 r \ < vfa for all r G Tj,. The interaction energy of 
a pair of spins is clearly maximized when both the x and y-terms are negative. This 
allows us to bound the energy by that in the isotropic case J\ = J 2 = 1 — i.e., the 
cosine of the angle between the spins. Hence, the energy between each neighboring 
pair is at most (— cos ij). We arrive at 

Z L {fi) > [V{ n )^ cosr >] L \ (5.9) 

where the phase volume V{rj) — 2-k[\ — cos^h)] may be small but is anyway indepen- 
dent of (3. 

To estimate the constrained partition function in the numerator of p{B), we will 
classify the bad blocks into two distinct categories: First there will be blocks where not 
all spins are within k of the pole and, second, there will be those bad blocks which, 
notwithstanding their Ising nature, will have defects in their ferromagnetic pattern. We 
denote the respective events by B\ and 82- To bound pL,p{Bi), since we may decorate 
the torus from a single site, we may as well run a single site argument 2 d -times. We 
are led to consider the constrained partition function where every site is outside its 
respective polar cap. It is not hard to see that the maximal possible interaction is 1 — 
A sin 2 k\ we may estimate the measure of such configurations as full. Thus, 

Pl,p(Bi) ^ 2<i ^-e^ 1 - cosri - Asin2 K l (5.10) 

Note that, by (5.8), this is small when (3 > 1. 

The less interesting Ising violations are estimated as follows: The presence of such 
violations implies the existence of a bond with nearly antialigned spins. We estimate 
the interaction of this bond at cos(2k). Now there are ad bonds on any cube so when 
we disseminate — using reflections through sites — we end up with at least one out of 
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every bonds with this energy. The rest we may as well assume are fully "aligned" — 
and have energy at least negative one — and we might as well throw in full measure, for 
good measure. We thus arrive at 

Pl.AB 2 ) < fld-^y exp|/3rf(^-cos(2K) + 1 - ^- - cost?) j (5.11) 

as our estimate for each such contribution to the Ising badness. Here the prefactor ad 
accounts for the choice of the "bad" bond. Since 1/aa > A / 2 , the constant multiply- 
ing [3d in the exponent is less than the left-hand side of (5.8); hence p L,f}{B 2 ) <C 1 
once (3 2> 1 as well. It follows that, given J\, J 2 < 1, we can find /3o sufficiently large 
so that pL,p{B) < pL,p(B\) + pL.,p{B 2 ) < 1 once /3 > (3 . The statement of the 
theorem is now implied by Proposition 3.9 and the ±-symmetry of the model. □ 



5.3. Large-entropy models. Here we will state and prove order-disorder transitions in 
models (2-3). As in the previous subsection, most of our analysis is classical. While 
we note that much of the material of this section has appeared in some form before, 
e.g., in [11,16, 21, 22, 33, 44], here we must go a slightly harder route dictated by the 
quantum versions of reflection positivity. 

We start with the observation that model (2) with 7{x) = Vi(x 2 ) - x7 2 (x 2 ) is 
unitarily equivalent, via a rotation of all spins about the z-axis, to the same model 
with y{x) = ?! (x 2 ) + xT 2 (a; 2 ) - Hence, it suffices to consider only the case of the plus 
sign. We thus focus our attention on models with classical Hamiltonians of the form 

p 

H°°(Q) = - ^2 ^c k {Q r of2 r ,) k , c fe >0, (5.12) 

{r,r') k=l 

where o Q 2 ) denotes the variant of the usual dot product — + 

(z) (z) 

Q\ Q\ for model (3), and the "dot product among the first two components" for 
model (2). We now state our assumptions which ensure that models (2) and (3) have the 
large entropy property. 

Let us regard the coefficients in (5.12) as an infinite (but summable) sequence, 
generally thought of as terminating when k = p. (For the most part we will require 
that (£ p be a polynomial. However, some of our classical calculations apply even for 
genuine power series.) The terms of this sequence may depend on p so we will write 

them as = (c^ p \ c 2 P \ . . . ); we assume that the € 1 -norm of each is one. Let 
£ p : [-1,1] -» R be defined by 

fe>i 

Here is the precise form of the large-entropy property: 

Definition 5.5. We say that the sequence (c^) has the large entropy property if there 
is a sequence (e p ) of positive numbers with e p I such that the functions 

A p (s) = <S P (1 - e p s) (5.14) 

converge — uniformly on compact subsets of [0, 00) — to a function s ^ A(s) with 

lim A(s) = 1 and lim A(s) = (5.15) 
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Remark 5.6. Despite the abstract formulation, the above framework amalgamates all 
known examples [21,22] and provides plenty of additional generality. A prototyp- 
ical example that satisfies Definition 5.5 is the sequence arising as the coefficients 
of the polynomial <£ p (x) — A general class of sequences c^ p ' is defined 

from a probability density function 0: [0, 1] — > [0, oo) via cj^ = ^<f>{ k / P )- In these 
cases we can generically take e p — 1 / p and the limiting function A is then given 
by A(s) = Jp 1 (f>(\)e.~ Xs d\. However, as the example <B p (x) = {Hp-) p shows, ex- 
istence of such a density function is definitely not a requirement for the large-entropy 

property to hold. What is required is that the "distribution function" J2k< P s c fc^ * s sma U 
for s < 1. 

Our analysis begins with the definition of good and bad events. First we will dis- 
cuss the situation on bonds: The bond (r ,r') is considered to be energetically good if 
the attractive energy is larger (in magnitude) than some strictly positive constant b (a 
number of order unity depending on gross details, where we recall that 1 is the optimal 
value), i.e., if 

£ p (f2 r oQ r ,) > b. (5.16) 

The entropically good bonds are simply the complementary events (so that every bond 
is a good bond). Crucial to the analysis is the fact, ensured by our large entropy assump- 
tion, that the crossover between the energetic and entropic phenotypes occurs when the 
deviation between neighboring spins is of the order y/e^. 

We define the good block events Gord and C/dis on the 2 x • • • x 2-block As as 
follows: C/ or d is the set of spin configurations where every bond on Ab is energetically 
good while Q&& collects all spin configurations where every bond on Ab is entropically 
good. The requisite bad event is defined as B — (Gord U GdisY- 

Our fundamental result will be a proof that the density of energetically good blocks 
is discontinuous: 

Theorem 5.7 (Large-entropy models). Consider a family of finite sequences = 

(c^)fc<p and suppose that <£ p have the large entropy property in the sense of Defini- 
tion 5.5. Consider the quantum spin systems with the Hamiltonian 

H(P) = - J2 £ P (S- 2 (S r oS r >)), (5.17) 

(r,r') 

(with both interpretations of (S r o S r >) possible). Then there exists b G (0,1) for 
which the associated energetic bonds have discontinuous density in the large S quantum 
systems. Specifically, for every e > there is apo < oo so that for any p > po and all S 
sufficiently large, there is an inverse temperature (3t at which there exist two distinct, 
translation-invariant KMS states and {— )* s with the property 

(QgJt ^ 1 - e and (QsJt > 1 - e - (5-18) 

With a few small additional ingredients, we show that the above implies that the 
energy density itself is discontinuous: 
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Corollary 5.8. There exist constants b and b', both strictly less than 1 /% such that the 
energy density e(/3) — defined via the fi-derivative of the free energy — satisfies 

t(P){- (5.19) 
\<b, ifP<Pu 

for all p sufficiently large. 

The bulk of the proof of this theorem again boils down to the estimate of pL.p (£>): 

Proposition 5.9. There exist b G (0, 1), A > 0, C < oo, and for each b e (0, b ] there 
exists po < oo such that 

lim p L ,p(B) < C(e p ) A (5.20) 

L^oo 

hold for all p > p and all (3 > 0. 

Apart from a bound on pL,p(B), we will also need to provide the estimates in con- 
dition (2) of Proposition 3.10. Again we state these in their classical form: 

Proposition 5.10. There exist constants C\ < oo, p\ < oo and A\ > such that the 
following is true for all p>p\: First, at (3 = we have 

limsupp L ,o(g o rd) < C^ep)^. (5.21) 

L^oo 

Second, if (3q <G (0, oo) is large enough, specifically ife l3ad > ep 2 ( 1+Al \ then 

lim sup pL^ftiis) < Ci(e p ) 41 . (5.22) 

L — >oo 

The proof of these propositions is somewhat technical; we refer the details to the 
Appendix, where we will also prove the corollary. 

Proof of Theorem 5. 7. We begin by verifying the three properties listed at the end of 
Sect. 5.1. As is immediate from the definitions, neighboring blocks of distinct type of 
goodness must be separated by a bad block. Similarly, reflections 8 t act on B only as 
translations. To see that the same applies to the "complex" reflections we have to 
check that B is invariant under the "complex conjugation" map a. For that it suffices to 
verify that a(f2) o a {ft') = Q o fl' for any S7, Q' E y 2 - This follows because both 
interpretations of fl o fl are quadratic in the components of fl and because a changes 
the sign of the y-component and leaves the other components intact. 

Let b < bo where bo is as in Proposition 5.9. Then (5.20) implies that pL,p(B) <SC 1 
once € p <C 1. Quantum chessboard estimates yield (Qa)l./3 < ^L,p{A) which by 
means of Theorem 3.1 implies that both (Qg dis )L,o an d {Qq^) L,p we close to one 
once L ^> 1 and ^/S is sufficiently large compared with (3o (referring to Proposi- 
tion 5.10). Theorem 3.7 then provides the remaining conditions required for application 
of Proposition 3.10; we conclude that there exists a /?, e [0,/3 ] and two translation- 
invariant KMS states (-)° rf and (-)* s such that (5.18) hold. □ 

Remarks 5.11. Again, a few remarks are in order: 

(1) Note that the theorem may require larger S for larger p, even though in many cases 
the transition will occur uniformly in S ^ 1 once p is sufficiently large. The tran- 
sition temperature (3 t will generally depend on p and S. 
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(2) There are several reasons why Theorem 5.7 has been stated only for polynomial 
interactions. First, while the upper symbol is easily — and, more or less, unambi- 
guously — defined for polynomials, its definition for general functions may require 
some non-trivial limiting procedures that have not been addressed in the literature. 
Second, the reduction to the classical model, cf Corollary 3.2, requires that the 
classical interaction be Lipschitz, which is automatic for polynomials but less so 
for general power series. In particular, Theorem 5.7 does not strictly apply to non- 
smooth (or even discontinuous) potentials even though we believe that, with some 
model-specific modifications of the proof of Theorem 3.1, we could include many 
such cases as well. 



5.4. Order-by-disorder transitions: Orbital-compass model. We begin by the easier of 
the models (4-5), the 2D orbital compass model. We stick with the reflection-positive 
version of the Hamiltonian which, on Tj,, is given by 

h l = -s- 2 E # a) sl°L> ( 5 - 23 > 

reT L a=x,z 

with e x , e y , e z denoting the unit vectors in (positive) coordinate directions. The num- 
ber B will only be determined later, so we define the good events for general B. 
Given k > (with k <C 1), let Q x be the event that all (classical) spins on a B x B 
block Ab satisfy 

\Q r -q x \> cos(k). (5.24) 
Let Q z be the corresponding event in the z spin-direction. Then we have: 

Theorem 5.12 (Orbital-compass model). Consider the model with the Hamiltonian 
as in (1.4). For each e > there exist k > 0, (3q > and c > and, for each f3 
with po < (3 < cVS, there is a positive integer B and two distinct, translation-invariant 
KMS states (— )^ and (— )^ such that 

(Qg a ) ( p } > l-c, a = x,z. (5.25) 

In particular, for all [3 with /3 < (3 < cVS, 

((S r -e a ) 2 ) { f f>S 2 (l-e), a = x,z. (5.26) 

The proof is an adaptation of the results from [5-7] for the classical versions of 
order-by-disorder. Let B = (Q x U Q z ) c denote the requisite bad event. By definition, B 
is invariant under reflections of (classical) spins through the rrz-plane; i.e., a(B) = B. 
Since the restrictions from B are uniform over the sites in A B , we have #t(B) = 
TBt(B). So, in light of our general claims from Sect. 5.1, to apply the machinery lead- 
ing to Proposition 3.9, it remains to show that pL_p(B) is small if (3 3> 1 and the 
scale B is chosen appropriately. For that let H°°(Q) denote the classical version of the 
Hamiltonian (5.23). By completing the nearest-neighbor terms to a square, we get 

H °°M = \ E E - + E t^ } ] 2 - (5.27) 

reT L a=x,z rGTt 
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Here fi r denotes the a-th Cartesian component of fi r . 

Unforuntately, the event B is too complex to allow a direct estimate of pL,p(B)- 
Thus, we will decompose B into two events, Be and B$w depending on whether the 
"badness" comes from bad energy or bad entropy. Let A > be a scale whose size will 
be determined later. Explicitly, the event Be marks the situations that either 

\fi^\> Cl A (5.28) 

for some site r £ Ab, or 

\fi r a) -n r %J>c 2 A/B, 

for some pair r and r + e a , both in Ab- Here c\, c 2 are 
momentarily. The event £>sw is simply given by 

B SW = B\B E - (5.30) 

By the subadditity property of p L ,p, we have pj^S) < pL,p(B E ) + Pl,p(Bsw)- 

Since Be implies the existence of an energetically "charged" site or bond with energy 
about { a /b) 2 above its minimum, the value of Pl,p{Be) is estimated relatively easily: 

PlABe) < c/3B 2 e -W B ', (5.31) 

for some constants c and c. (Here cB 2 accounts for possible positions of the "excited" 
bond/site and (i comes from the lower bound on the classical partition function.) 

As to Z?sw, here we will decompose further into more elementary events: Given a 
collection of vectors wi , . . . , w s that are uniformly spaced on the first quadrant of the 
main circle, J^++ = {fi e S" 2 ■ O ■ e y = 0, fi^ > 0, fi^ > 0}, we define to 
be the set of configurations in B$w such that 

\fi r x) ■ wf } | + \fi r z) ■ w, (z) | > cos(Z\), r e A B - (5.32) 

Since Bsw is disjoint from Be, on Bsw the y-component of every spin is less than 
order A and any neighboring pair of spins differ by angle at most A (up to a reflection). 
Hence, by choosing c\ and c 2 appropriately, any two spins in Ab will differ by less 
than A from some Wj, i.e., 

s 

Ssw C |J B®, (5.33) 

i=l 

provided that sA exceeds the total length of To estimate p L,p(B^) we will have 

(i) 

to calculate the constrained partition function for the event B s -^. The crucial steps of 
this estimate are encapsulated into the following three propositions: 

Proposition 5.13. Consider the classical orbital compass model with the Hamiltonian 
H°°(fi) as in (5.27) and suppose that A <C 1. Then for alii = 1, . . . , s, 

Pl.A4w) < 2 2B e- s2 ( F ^^)- F ^( Sl », (5.34) 

where, for each w e ^ ++ = {v G ,¥ 2 : v • e 2 = 0, v (x) > 0, v (z) > 0}, 

f,,m - -A*L^° ( n;u,.^.,). 

(5.35) 



(5.29) 

constants to be determined 
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Proposition 5.14. For each e > there exists 8 > such that if 

(3 A 2 > i and f3A 3 < 5, (5.36) 
o 

then for all L sufficiently large, (w) — F(w) \ < e holds for any w <E with F 

given by 

F W = o/ T^log^Cw). (5.37) 
HereD k (w) = w 2 z \l - e ifel | 2 + w 2 x \l - e ik2 \ 2 . 

Proposition 5.15. 77ze function w i— > F(w) is minimized (only) by vectors w = ie^ 
andw = ±e z . 

The proofs of these propositions consist of technical steps which are deferred to the 
Appendix. We now finish the formal proof of the theorem subject to these propositions: 

Proof of Theorem 5.12 completed. As already mentioned, the bad event is invariant 
under both spatial reflections 9 t and the "internal" reflection cr; hence dt (B) = Tst (&) 
as desired. Second, if two distinct good events occur in neighboring blocks, say A b 
and Ab + Bs\, then at least one of the bonds between these blocks must obey (5.29); 
i.e., the box As + ei is (energetically) bad. Third, we need to show that pL,p(B) is 
small. We will set A and B to the values 

A = 0~& and Bk log/3. (5.38) 

These choices make Pl,p{Be) small once (3 is sufficiently large and, at the same time, 
ensure that (5.36) holds for any given S. Since we have (5.34), Propositions 5.14-5.15 

and the fact that B^l, being a subset of B, is empty when Wj is within, say, V2 of ±e x 
or ±e z tell us that 

PlABsw) < se-i eB " (5.39) 
once B is sufficiently large. But s is proportional to Vz\ and so this is small for (3 
sufficiently large. We conclude that as (3 — > 00, we have pL,p{B) — » for the above 
choice of B and A. 

Having verified all required conditions, the xz-symmetry of the model puts us in 
a position to apply Proposition 3.9. Hence, for all sufficiently large (3, there exist two 
infinite-volume, translation-invariant KMS states (— )^ and {—)^ such that (5.25) 
holds. To derive (5.26), we note that, for any vector w e y 2 and any single-spin coher- 
ent state I Q) 

S -w\Q) =S{w ■ Q)\Q) +0(VS). (5.40) 

Hence, (S ■ &k) 2 Qg k — S 2 Qg h + 0(S 3 ^ 2 ), where all error terms indicate bounds in 
norm. Invoking (5.25), the bound (5.26) follows. □ 

Remark 5.16. The 3D orbital-compass model is expected to undergo a similar kind of 
symmetry breaking, with three distinct states "aligned" along one of the three lattice di- 
rections. However, the actual proof — for the classical model, a version of this statement 
has been established in [7] — is considerably more involved because of the existence 
of (a large number of) inhomogeneous ground states that are not distinguished at the 
leading order of spin- wave free-energy calculations. We also note that an independent 
analysis of the classical version of the 2D orbital-compass model, using an approach 
similar to Refs. [6,7] and [41], has been performed in [40]. 
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5.5. Order-by-disorder transitions: 120-degree model. The statements (and proofs) for 
the 120-degree model are analogous, though more notationally involved. Consider six 
vectors Vi , . . . , v 6 defined by 

vi=e x , v 2 = ^e x + ^e 2 , v 3 = -\t x - ^e 2 (5.41) 
v 4 = -e x , v 5 = -±e x - ^e z , v 6 = ie^ - ^e z . (5.42) 

As is easy to check, these are the six sixth complex roots of unity. The reflection-positive 
version of the Hamiltonian on then has the form 

H = -S- 2 E ( S r ■ V2 a )(S r+ , a ■ V 2o ), (5-43) 

r£T t a=l,2,3 

where ei, e 2 , e 3 is yet another labeling of the usual triplet of coordinate vectors in 1? . 
To define good block events, let k > satisfy k«1 and let Gi, ■ ■ ■ , Qe be the _B-block 
events that all spins f2 r , r E As, are such that 

&r ■ v « > cos(k), a = 1, ... ,6, (5.44) 

respectively. Then we have: 

Theorem 5.17 (120-degree model). Consider the 120-degree model with the Hamilto- 
nian (5.43). For each e > there exist k > 0, /3o > and c > and, for each (3 
with Po < P < cVS, there is a number B and six distinct, translation-invariant 
states (—)^"\ a = 1, ... ,6, such that 

(Qg a ) { ^ > l-c a = l,...,6. (5.45) 

/n particular, for all [3 with f3 < [3 < cVS, 

(S r ■v a )J ,) >5(l-e), a=l,...,6. (5.46) 

Fix k > (with k < 1) and let B and Z\ be as in (5.38). Let B = (Q 1 U • • • U £ 6 ) c 
be the relevant bad event. It is easy to check that B is invariant with respect to a and, 
consequently, -dt{B) = B for all r e T L / B as required. Introducing the projections 

n^=f2 r -w a , a = l,...,6, (5.47) 
and noting that, for any vector w e J^ 2 , 

2 (w.v Q ) 2 = ^[l-(w.e,) 2 ], (5.48) 

a=l,2,3 

the classical Hamiltonian H°°(fi) can be written in the form 

H°°{n) = \ £ E (^ 2Q) - ^l) 2 + I E («r -e,) 2 - §|T L |. (5.49) 

r£T L a=l,2,3 r£T L 

As for the orbital-compass model, we will estimate pL,p (B) by further decomposing B 
into more elementary bad events. 
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Let Be denote the event that the block Ab contains an energetically "charged" site 
or bond. Explicitly, Be is the event that either for some r e Ab we have 

|«r-e„|>ci4. (5-50) 
or, for some nearest-neighbor pair (r,r + e a ) in Ab, we have 
i i ^ 

\ftr ■ V2a - H r+ia ■ W 2a \ > C 2 — ■ (5.51) 

Here c\ and c 2 are constants that will be specified later. The complementary part of B 
will be denoted by Bsw, i- e -> 

B S w = B\B E . (5.52) 

By the fact that Bsw C B E , on Bsw the energetics of the entire block is good — i.e., 
the configuration is near one of the ground states. Clearly, all constant configurations 
with zero y-component are ground states. However, unlike for the 2D orbital-compass 
model, there are other, inhomogeneous ground states which make the treatment of this 
model somewhat more complicated. Fortunately, we will be able to plug in the results 
of [6] more or less directly. 

As for the orbital-compass model, to derive a good bound on p l,/3 (Bsw) we will fur- 
ther partition Bsw into more elementary events. We begin with the events corresponding 
to the homogeneous ground states: Given a collection of vectors Wj, i = 1, . . . , s, that 

are uniformly spaced on the circle C 5? 2 in the xz-plane, we define B^ to be the 
subset of Bsw on which 

Q r ■ Wj > cos(Z\), r e A B . (5.53) 

To describe the remaining "parts of Bsw," we will not try to keep track of the en- 
tire "near ground-state" configuration. Instead, we will note that each inhomogeneous 
ground state contains a pair of neighboring planes in Ab where the homogenous config- 
uration gets "flipped" through one of the vectors vi , . . . , %. (We refer the reader to [6], 
particularly page 259.) Explicitly, given a lattice direction a = 1, 2, 3 and a vector w € 
5?\, let w* denote the reflection of Wj through v 2a -i. For each j = 1, . . . , B — 1, we 

then define B^- to be the set of spin configurations in Bsw such that for all r e Ab, 

Q r ■ w 4 > cos(A) if r-e a =j, 

„ (5.54) 
fl r ■ Wj > cos(Z\) if r ■ e a = j + 1. 

(Note that r • e Q = j means that the a-th coordinate of r is j. Hence, on B„ j, the 
spins are near Wj on the j-th plane orthogonal to e Q and near w* on the j + 1-st plane 
in Ab-) The conditions under which these events form a partition of B is the subject of 
the following claim: 

Proposition 5.18. Given k > 0, there exist c\ , c 2 > such that if Be and Bsw are 

defined as in (5.50-5.52) and if A and B are such that BA <C k -C 1 and sA > An, 
then 

SswC[j(fi«U IJ tj B S) (5 ' 55) 

i=l ^ a=l,2,3 j = l ' 
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Next we will attend to the estimates of pL,& for me various events constituting B. 
As for the orbital-compass model, the event Be is dismissed easily: 

PlABe) <c(3B 3 e-^ A2 / B \ (5.56) 
where c and c are positive constants. As to the events Bq \ here we get: 
Proposition 5.19. For each n > there exists 8 > such that if (3 and A obey 

(3A 2 > \ and f3A 3 < 8, (5.57) 
o 

then for all L sufficiently large, 

PlAbP) < e-^W, i = l,...,s. (5.58) 

Here pi (k) > for all k«1. 

For the "inhomogeneous" events the decay rate is slower, but still sufficient for our 
needs. 

Proposition 5.20. For each k > there exists 8 > such that if (3, A and 8 obey (5.57), 
then for all j = 1, . . . , B — 1, all a = 1, 2, 3 and all L sufficiently large, 

PlA^)<^ b2p2{k \ i = l,...,s. (5.59) 
Here pi{n) > for all k<1. 

Again, the proofs of these propositions are deferred to the Appendix. 

Proof of Theorem 5.17 completed. We proceed very much like for the orbital com- 
pass model. The core of the proof again boils down to showing that pL,p(B) is small, 
provided B is chosen appropriately. Let A and B be related to /3 as in (5.38). By 
(5.56), this choice makes Pl,p{Be) small and, at the same time, makes (5.57) eventu- 
ally satisfied for any fixed 8 > 0. Invoking Propositions 5. 19-5.20, and the subadditivity 
of A h-» pL, 13(A), we have 

Pl !/3 (£sw) < s(e- B3 >>^ + 3Be- B2 ^) (5.60) 

which by the fact that s = 0(A^ 1 ) implies Pl./jC^sw) <C 1 once (3 is sufficiently 
large. Using that PlA b ) < Pl,p(B e ) + pL,p( B sw)> the desired bound p L A B ) < 1 
follows. 

It is easy to check, the bad event B is preserved by "complex conjugation" a as well 
as reflections and so the # t 's act on it as mere translations. Moreover, once k <C 1, 
if two distinct types of goodness occur in neighboring blocks, all edges between the 
blocks are of high-energy — any block containing these edges is thus bad. Finally, the 
model on torus is invariant under rotation of all spins by 60° in the xz-plane. This means 
that all conditions of Proposition 3.9 are satisfied and so, for (3 » 1 and S 3> [3 2 , the 
quantum model features six distinct states obeying (5.45). From here we get (5.46). □ 
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6. Appendix 

This section is devoted to the proofs of various technical statements from Sects. 5.3, 5.4 
and 5.5. Some of the proofs in the latter two subsections are based on the corresponding 
claims from [6,7]. In such cases we will indicate only the necessary changes. 



6.1. Technical claims: Large-entropy models. Consider a sequence (c' p ^) satisfying 
the large-entropy property and assume, without loss of generality, that ||c' p ^|| = 1 for 
all p > 1. Our goal here is to provide the bounds on pi ;j g(S) and the asymptotic state- 
ments concerning the dominance of the two types of goodness which were claimed in 
Propositions 5.9 and 5.10. We begin with a lower estimate on the full partition function. 

Lemma 6.1. Let t > be fixed. Then there exists p\ < oo and constants C\,C2 € 
(0, oo) such that for all p > p\ and all (3 > 0, 

liminf {Z L ) 1/Ld > ma^{c 1 e p e 0dA " {t \c 2 }. (6.1) 

L — >oc 

Proof. We will derive two separate bounds on the partition function per site. Focussing 
on the cases when f2 r o f2 r / involves all three components of the spins, let us restrict 
attention to configurations when every spin is within angle c^/e^ of the vector (0, 0,1), 
where c is a constant to be determined momentarily. Let £1 and fi' be two vectors with 
this property. Then the (diamond) angle between J? and J?' is less than 2c^fe^ and so 

Q o Q' > cos(2c^) > 1 - 2c 2 e p . (6.2) 

Choosing 2c 2 = t, we thus have Q o Q' > 1 — te p . This means that the energy of any 
bond in the configuration obeying these constraint is at least A p (t); while each spin has 

at least 1 — cos(cy^) w \(?e p surface area at its disposal. This implies that (Zl) 1 ^ 
is bounded by the first term in the maximum with ci w \<? . The other interpretation 
of Q r o Q r i is handled analogously. 

In order to derive the second bound, we will restrict all spins to a sector of angular 
aperture V2, e.g., the one described as {{2 = (r? 1 , J7 2 , fl 3 ) e y 2 : H 1 > l/\/2}. This 
has area a which is a fixed positive number. Moreover, the constraint ensures that the 
interaction between any two spins is non-positive; the partition function per site then 
boils down to the entropy of such configurations. To evaluate this entropy, we fix the 
configuration on the even sublattice. Every spin on the even sublattice is then presented 
with 2c? "spots" on this sector which it must avoid. The area of each such spot is a 
constant times e p . It follows that (Zl) 1 ^ > a — 0(e p ) which is positive once p is 
sufficiently large. □ 

Our next bound concerns the constrained partition function Z™ lx (£) obtained by 
disseminating a particular pattern C of ordered and disordered bonds (i.e. energetically 
and entropically good bonds) over the torus, when £ is a genuine mixture of the two. 
That is, we assume that C contains bonds of both phenotypes. We remark that this dis- 
semination is carried out by means of reflections in planes of sites (which is permissible 
by the nearest-neighbor nature of the interaction). Recall that = d2 d ~ 1 is the number 
of bonds entirely contained in the 2 x • • • x 2 block Ab- 
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Lemma 6.2. Let t > be such that 

1 - (1 - b)/a d 
A p (t) 

and 

def 



< 1 (6.3) 



[1 11 6 1 
A = mm\ 1 + — -, — - \ > 0. (6.4) 

[ a d A p (t) a d Ap(t) J 

Then there exists a constant c 3 < oo such that for any (3 > and any pattern C of 
ordered and disordered bonds (i.e. energetically and entropically good bonds) on Ab 
containing at least one bond of each phenotype, 

limsup Zf x (C) 1/Ld < c 3 max{c 1 e )) e^ (t) ,c 2 } (e p ) A . (6.5) 

L^oo 

Proof. Fix a pattern C as specified above. As usual, we call a bond disordered if it 
is entropically good. Let /& denote the fraction of disordered bonds in pattern L. Let 
us call a vertex an "entropic site" if all bonds connected to it are disordered. (Note 
that this has two different, but logically consistent, connotations depending on whether 
we are speaking of a vertex in Ab or in T^.) Let f s denote the fraction of entropic 
sites in L. Upon dissemination (by reflections through planes of sites), these numbers 
fb and f s will represent the actual fractions of disordered bonds and entropic sites 
in Tl, respectively. Now each disordered bond has an energetic at most b, while we 
may estimate the energy of each ordered bond by 1. For each entropic site we will 
throw in full measure so we just need to estimate the entropy of the non-entropic sites. 
Here we note that each ordered bond disseminates into a "line" of ordered bonds, upon 
reflections. If we disregard exactly one bond on this "line of sites", then we see that 
there is a total measure proportional to 0(e p ^ 1 ). Since this entropy is shared by the L 
vertices on this line, the entropy density of each vertex on this line is 0(e p ) in the 
L — > oo limit. This is an upper bound for the entropy density for each non-entropic site. 
The bounds on energy show that the Boltzmann factor is no larger than 

e l3d{i-h)+f3dbh =e Mi-(i-6)A]. (6.6) 
We thus conclude that, for some constant 63, 

limsup Z L (Cf' Ld < ^(cp) 1 -^ 1 -* 1 " 6 )'*]. (6.7) 

L — >oo 

Now, we may write the right-hand side as 

i-(i-fe)/b 

5 3 (e p eW) AAt) (e p ) A ^ (6.8) 

where 

Since C contains at least one entropic bond, we know fb > l/a d . Our choice of t 
guarantees that 1 — (1 — 6) fb < 1 — (1 — b)/a d < A p (t) and so the complicated exponent 
in (6.8) is bounded by 1. We may use the famous identity X X Y 1 ~ X < max(X, Y), true 
whenever X, Y > and < A < 1, to bound the term with the complicated power 
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in (6.8) by the maximum in (6.5). (We set X — ae p e^ dAp ^ and Y = c 2 , absorbing 
extra order- 1 constants into our eventual C3.) It remains to show that A(C) exceeds A 
in (6.4) whenever C contains both pheno types of bonds. 

We will derive a relation between f s and ft, that holds whenever C contains both phe- 
notypes of bonds. We may give the argument in either picture — where we restrict to the 
small block Ab or where we consider the full torus Tj, after disseminating C — which 
are entirely equivalent since the fractions of entropic bonds and sites are the same. We 
will give the argument in the small 2 x • • • x 2 block Ab- Since C contains bonds of 
both phenotypes there are at least two vertices in Ab each of which "emanates" bonds 
of both phenotypes. We mark these sites, and for each of them we mark one of the inci- 
dent entropically good (disordered) bonds. We now consider the bonds of Ab to be split 
into half-bonds each of which is associated to the closest incident vertex (disregarding 
the midpoints). We label each half-bond as entropic or energetic, according to whether 
it is half of a full bond which is entropically or energetically good. 

Let H be the total number of entropic half-bonds. Now note that for each entropic 
vertex, all d of the half-bonds emanating from it (and contained in Ab) are "entropic 
half-bonds". We also have at least two additional entropic half-bonds associated to the 
two marked sites. Therefore the number of entropic half bonds satisfies the bound H > 
d2 d f s + 2. (Note that there are 2 d f s entropic sites.) Since there are 2a,d = d2 d total 
half -bonds in Ab, the proportion of entropic half bonds is at least f s + l/a^. At this 
point let us observe that the proportion of entropic half -bonds is exactly the same as the 
proportion of entropic full-bonds, f b . Therefore 

f b >fs + ~- (6.10) 
ad 

Plugging this into the formula for A(C) we thus get 

A(C)>l + ^-f b - 1 - i '-. b)fb . (6.11) 
ad An*) 

Allowing ft to take arbitrary values in [0, 1], the right-hand side is minimized by one 
of the values in the maximum in (6.4). Hence, A(C) > A whereby (6.5) follows. □ 

Proof of Proposition 5.9. As usual, we consider events disseminated by reflections in 
planes of lattice sites. Let bn < — ^ — . If b < bn, then, as a calculation shows, the 

r u l+a d _ u> > 

bound (6.4) holds as well as (6.3) for t such that A p (t) > 1 — b. Such a t can in turn 
be chosen by the assumption that the model obeys the large-entropy condition. (This is 
where we need that p is sufficiently large.) Hence, the bound in Lemma 6.2 is at our 
disposal. Now the maximum on the right-hand side of (6.5) is a lower bound on the 
full partition function per site; the lemma thus gives us bounds on phfi of the events 
enforcing the various patterns on Ab- Since B can be decomposed into a finite union of 
such pattern-events, the desired inequality (5.20) follows. □ 

Proof of Proposition 5.10. Again we work with events disseminated using reflections 
in planes of sites. In order to prove (5.21), we note that <& p (fi r o Q r i) > b — which 
is what every bond (r ,r') in Ab satisfies provided J? e Q or a — implies Q r o fi r > > 
1 — ce p . The neighboring spins are thus constrained to be within angle 0(y/e^) of each 
other. Disregarding an appropriate subset of these constraints (reusing the "line of sites" 
argument from the first part of the proof of Lemma 6.2) the desired bound follows. 

To prove (5.22), we note that the disseminated event C?dis forces all bonds to have en- 
ergy less than b. Lemma 6. 1 implies that the corresponding p ^-functional is bounded 
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above by C\ (e p ) 1 e /3d [ fc . Assuming that b < V2 and t is chosen so that A p (t) - 

b > V2. we see that if is large enough to satisfy 

e /3d > c -2(l+4 l)} (612) 

then the bound is less than C\ (e p ) Al . □ 

Given the existing results on the discontinuity of energetic bonds, it is almost in- 
conceivable that the energy density itself could be continuous. To mathematically rule 
out this possibility, we will show that, in actuality very few of the energetic bonds have 
value in the vicinity of b. So while the previous argument only considered two types of 
bonds, we will henceforth have the following three types of bonds: 

(1) strongly ordered if <& p {Q r o f2 r >) > 1 — b', 

(2) weakly ordered if 1 — b' > (£ p (f2 r o £2 r i) > b, 

(3) disordered if (£ p (f2 r o Q r i) < b. 

Here < b',b < V2 are constants which we will choose later, although we already 
know that we have the restriction b < 1/(1 + ad) as was necessary in the proof of 
Proposition 5.9. A rather similar line of argument to that previously used for mixed 
patterns of ordered and disordered bonds handles the situation for mixed patterns of 
weak and strong order. For each pattern C of weakly and strongly ordered bonds on 
Ab, let Z°l d (C) denote the partition function obtained by disseminating C all over the 
torus. Then we have: 

Lemma 6.3. Let t > be a number such that 

There exists a constant C4 < 00 such that for any (3 > and any pattern L of weakly 
and strongly ordered bonds on the 2 x • • • x 2 block Ab containing at least one weakly 
ordered bond, 

limsup Zf{C) 1/Ld < c 4 max{cie p e^W,c 2 } {e p ) A> . (6.14) 

L — >oo 



Proof. Consider an ordered pattern L with fraction /„, of weakly ordered bonds. After 
dissemination all over Tj,, there is a fraction /„, of bonds on that are weakly ordered 
and a fraction fraction 1 — /„, that are strongly ordered. Putting energy 1 — 6' for each 
weakly ordered bond and 1 for each strongly ordered bond, the Boltzmann weight of 
any spin configuration contributing to Z£ rd (£) is at most 

To calculate the entropy, we again use the "line of sites" argument from the first part of 
the proof of Lemma 6.2, which gives an entropy per site on the order of 0(e p ) in the 

L — > 00 limit. This implies that the limsup of Z£ rd (£) 1 / i is bounded by a constant 
times e p e l3d ( 1 - h ' 'M . Since 1 - b' f w < 1 - b'/a d we get 

l-b'/a d 

limsup Zl\Ctl Ld < cJe p ^ dA ^A ApW (e p ) A \ (6.16) 
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for some constant C4 < 00. By (6.13), the exponent of the term e p t tidAr ^ is less than 1 
and so the famous identity, X X Y 1_ A < max{X, Y}, may be used again (as in the proof 
of Lemma 6.2) which readily yields the bound (6. 14). □ 

Proof of Corollary 5.8. The proof is based on thermodynamical arguments. First, stan- 
dard calculations using coherent states show that 

£ p (S- 2 (S r oS r ,))\Q) = £ p {Q r oQ r ,)\Q) + 0{1/VS) (6.17) 

where the error term depends implicitly on p. Hence, for a given p and S > 0, we can 
find S so large that for any r , r ' e Ab 

(n\<£ p (s- 2 (s r o s r .))Q A \n) f>i-v-s, if A = g OIdl 
(n\Q A \n) \<b + s, if A = g dis . 

(At the classical level the second case is by definition, whereas the first case follows 
from Lemma 6.3.) Since (3 1— » e(/3) is increasing, we conclude that (5.19) holds. As a 
technical point, we note that in the statement of the corollary we did not include the 
small corrections corresponding to 5 > 0. This was primarily for aesthetic reasons: we 
wanted to state the simplest possible result. We can clearly accomplish this by taking b 
and b' to be a little smaller than is otherwise needed. □ 



6.2. Technical claims: Orbital-compass model. Here we will prove Propositions 5.13- 
5.15 concerning the orbital-compass model. The proofs follow the strategy developed 
in the context of the 120-degree model [6]. 

Proof of Proposition 5.13. The proof goes by one more partitioning of B^. Consider a 

spin configuration J? = (Q r ) r eT L *= ^sw- Since B^ C Z?sw an d ^ <S 1, it is easy to 
check the following facts: 

(1) the y-components of all spins in Ab are small. 

(2) the x-components of the spins along each "line of sites" (in Ab) in the rr-direction 
are either all near the x-component of vector Wj or its negative. 

(3) same is true for the z-components of the spins on "lines of sites" in the z lattice 
direction. 

Thus, at the cost of reflecting the x-components of spins along each "line of sites" in 
the x-direction, and similarly for the z-components, we may assume that all spins are 
aligned with Wj in the sense that 

fl r ■ Wj > cos(Zi), r e A B . (6.19) 

Let denote the set of configurations satisfying (6.19). The above reflection pre- 

serves both the a priori measure and the Hamiltonian (5.27); the event B^ is thus 

partitioned into 2 2B "versions" of event B$^ all of which have the same value of pL,p- 
functional. Invoking the Subadditivity Lemma, (5.34) is proved once we show that 

PlAB ( w } ) < e- s2 ^.-^)- F ^( 6l ». (6.20) 
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This follows by noting that e B Fl >^( Wi ) is, to within a convenient multiplier, the in- 
tegral of the Boltzmann weight Q-P H °°( n ) on the event £>^,°' while e~ s2 Fi .^( ei ) pro- 
vides a lower bound on the partition function (again, to within the same multiplier which 
thus cancels from the ratio). □ 

Proof of Proposition 5.14. The principal idea is to derive upper and lower bounds on 
Fl,a{w) which converge, in the limit L — > oo, to the same Gaussian integral. Let us 
parametrize w e as (cos 0+, 0, sin 0*) and, given a spin configuration fi that sat- 

isfies fl r ■ w > cos(Z\) for all r e Tl, let us introduce the deviation variables (i? r , ( r ) 
by the formula 

Q r = (y/l - CI cos(0* + r ), Cr , v 7 ! - sin(0* + r )). (6.21) 

Noting that both -d r and £ r are order A, we derive that H°°{Q) + |Tj,| is, to within a 
quantity of order i 2 Z\ 3 , equal to the quadratic form 

A,wW0 = ^ E - ^+ej 2 (0 P - 1?r.+eJ 2 } + ]T Cr (6-22) 

The Jacobian of the transformation f2 r (i? r , £ r ) is unity. 

Next we will derive upper and lower bounds on the integral of e~ l3 ' yL - i against the 
product of indicators in (5.35). For the upper bound we invoke the inequality 



II W>cc.<4)} < e**"^' exp{-^ £ ^.23) 



\0 



valid for each A > 0. The £ r 's are then unrestricted and their integrals can be per- 
formed yielding a factor y / 2ir//3 per integral. The integral over -d r 's involves passing 
to the Fourier components, which diagonalizes the covariance matrix. The result is best 
expressed in L — > oo limit: 

liminf F L _ A (w) > 0(/3Z\ 3 ) + -Xf3A 2 + F(X,w), (6.24) 

L^oo 2 

where 

nA,w) = -/ log[A + ^ fc (w)] (6.25) 

By the Monotone Convergence Theorem, F(X, w) converges to F(w) as A J. 0. Since 
(3 A 3 is less than 6, which is up to us to choose, taking A J. on both sides of (6.24) we 
deduce that Fl,a{w) > .F(w) — e for L sufficiently large. 

It remains to derive the corresponding lower bound. Here we will still work with the 
parameter A above but, unlike for the upper bound, we will not be able to take A J, at 
the end. Consider the Gaussian measure P\ which assigns any Borel set ylc (Kx M.) Tl 
the probability 

(6.26) 
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Let E\ denote the corresponding expectation. From f3\ > we get 

f dfl e-MAW f Yl 1 {rv *> cos(4)} ) >Z L (\)Ex( [] 1 { fl r .*>co.(4)}). 
J(.y 2 y' 1 ^ Veil ' VeTj, ' 

(6.27) 

The free-energy corresponding to the normalization constant Zl(A) is exactly F(X, w) 
above. Thus, given e > 0, we can find A > such that Z L (\) > e -L 2 [F(«)+e/2] 
once L > 1. It remains to show that the expectation is at least er L c / 2 provided S in 
(5.36) is sufficiently small. 

Here we first decrease the product by noting that 

This decouples the ( r 's from the i9 r 's and allows us to use the independence of these 
fields under P\. Since the £ r 's are themselves independent, the integral over Q r boils 
down to 

e 4 n i {ic.i<-/ 2 }) = n MKr\<^) > (i-*-w*"f, (6.29) 

where we used the standard tail bound for normal distribution. Note that, for any fixed 
A > 0, the term 1 — e~ A,3zl / 4 can be made as close to one as desired by increasing (3 A 2 
appropriately. 

The fl r 's are not independent, but reflection positivity through bonds shows that the 
corresponding indicators are positively correlated, i.e., 

e 4 n i {hM<-/ 2 >) ^ n MWr\< A h). (6.30) 

The probability on the right-hand side is estimated using a variance bound: 

ftflM > «,) < (|)W) - ^E^^ w (6J1) 

where T£ denotes the reciprocal torus. Again, for any fixed A, P\(|$r| < ^2) can 
be made as close to one as desired once f3A 2 is sufficiently large. We conclude that, 
given e > 0, we can choose 8 such that F L ^(w) < F(w) + e once L » 1. This 
finishes the proof. □ 

Proof of Proposition 5.15. Since w^ + w 2 z = 1, this is a simple consequence of Jensen's 
inequality and the strict concavity of the logarithm. □ 



6.3. Technical claims: 120-degree model. Here we will provide the proofs of technical 
Propositions 5.18-5.20. The core of all proofs is the fact that any spin configuration 
{fl r ) can be naturally deformed, by rotating along the main circle orthogonal to the xz- 
plane, to have zero y-component. An explicit form of this transformation is as follows: 
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Let us write each fl r e ,5^2 using two variables ( r e [—1,1] and 9 r e [0, 27r) inter- 
preted as the cylindrical coordinates, 

fir = ( Vl-Cr C0S d r,(r, y/ ± ~ Q sin ^ ) • (6.32) 

Then J2£ is the vector in which we set ( r = 0, i.e., 

J? p = (cos0 r ,O,sin0 P ). (6.33) 

(We have already used this transformation in the proof of Proposition 5.14.) An ad- 
ditional useful feature of this parametrization is that the surface (Haar) measure dfl r 
on S^i then decomposes into the product of the Lebesgue measure dfl' r on 5?\ and the 
Lebesgue measure d^ on [— 1 , 1] . 

Proof of Proposition 5.18. We will use the fact that, for configurations on As with 
vanishing component in the y-direction, this was already proved as Theorem 6.4 in [6]. 
Let {fi r ) G £>sw and define {fl' r ) be as above. Since \fl r -e y \ < c\AjB for all r e A B , 
we have 

\{fl r - fl' r ) -z y \ < c^A/B (6.34) 

while 

{fl r - fl' r ) -e Q = 0{A 2 /B 2 ), a = x,z. (6.35) 

In particular, the configuration {fl' r ) is contained in the version of event Ssw from [6], 
provided c 2 is a sufficiently small numerical constant. Thus, under the condition BA <C 
k < 1 — which translates to the condition B\fT < k < 1 of [6, Theorem 6.4] — ) 
is contained in one of the events on the right-hand side of (5.55). But, at the cost of a 
slight adjustment of A, the corresponding event will then contain also {fl r ). □ 

To prove the bounds in the remaining two propositions, we will more or less directly 
plug in the results of [6]. This is possible because the y-component of the spins con- 
tributes only an additive factor to the overall spin-wave free energy. The crucial estimate 
is derived as follows: 

Lemma 6.4. There exists a constant c > such that the following is true: Let A <C 1 
and let fl = {fl r ) be a configuration onT^ such that \fl r ■ & v \ < A 2 and \fl r 2a ^ — 
I — A /or a// a = 1,2, 3. Define fl' = {fl' r ) as above. Then 



H°"{fl)-H°"{fl')-\ Y^iVyZyf 



< cA 3 L 3 . (6.36) 



Proof. By the fact that fl r • e y = 0{A) we have 

fl r -y a = fl'r -y a + 0{A 2 ). (6.37) 
But then the assumption fl r 2a ^ — fl r 2 +l a = 0{A) yields 

[{flr - flr+ej ■ V 2 J ' - [{fl'r ~ ^ + ej ' V 2q ] ' + 0{A 3 ). (6.38) 

Using (5.49), this proves the claim. □ 

Proof of Proposition 5.19. The quantity pL,p{Bo^) is the ratio of the partition function 
in which all spins are constrained to make angle at most A with w,, and the full partition 
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function. The restriction B^ C i?sw can > f° r tne most P^ 1 ' be ignored except for 
the Wj's that are close to one of the six preferred directions. In such cases the fact 
that A <C k tells us that Bq is empty whenever the angle between Wj and the closest 
of vi , . . . , v 6 is less than, say, K / 2 . In particular, we may restrict attention to the Wj's 
that are farther than K /2 from any of these vectors. 

Viewing the collection of angles (9 r ) as a configuration of 0(2)-spins, Lemma 6.4 
tells us that the Hamiltonian of (Q r ) is, to within corrections of order L 3 A 3 , the sum of 
I Cr an d tn e Hamiltonian of the classical, 0(2)-spin 120-degree model evaluated at 
configuration (9 r ). Since the measure dfi r equals the product d( r d9 r on the respective 
domain, we may ignore the restriction of ( r to values less than 0(A) and integrate 

the ( r 's. We conclude that p l,p(Bq^) is bounded by the same quantity as for the 0(2)- 
spin 120-degree model times Q°(f 3A - 3 ) . Since (3 A 3 is controlled via (5.57), the desired 
bound follows from [6, Lemma 6.9]. □ 

Proof of Proposition 5.20. The proof is very much like that of the previous proposition. 
Let B^j denote the event that the top line in (5.54) holds for all r G Ab for which r ■ e Q 
is odd and the bottom line for all such r for which r ■ e a is even. Chessboard estimates 
then yield 

pLA^Si)<PLA^Si) a/B - 

On the disseminated event C\teT L /B 9 t (B^j) the assumptions of Lemma 6.4 are satis- 
fied. Hence, we may again integrate out the ( r 's to reduce the calculation to that for 
0(2)-spins. The latter calculation was performed in detail in [6]; the desired bound is 
then proved exactly as Lemma 6.10 of [6] (explicitly, applying inequality (6.24) of [6] 
and the paragraph thereafter). □ 
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